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CUSTOMIZED PROTEASES 

5 Background of the Invention 

Peptides used for pharmaceutical purposes 
will, in the future, more frequently be produced through 
the exploitation of genetic engineering. However / 
genetic engineering has limits to its capabilities. For 

10 example, expression of recombinant peptides bearing non- 
naturally occurring L-amino acids, D-amino acids, 
radioactive amino acids, and other detectable labels is 
not possible through recombinant techniques because 
there is no genetic code which codes for these 

15 modifications. In addition, naturally occurring amino 
acid modifications such as C-terminal amide group 
substitution, which are routinely performed in vivo , are 
difficult to perform in vitro . These post -translation 
modifications are important because they often result in 

20 the most potent or longest acting form of the peptide 
and constitute the naturally occurring form , of the 
peptide often needed for pharmaceutical use. 

There are techniques for modification of 
recombinant peptides. One such technique is C-terminal 

25 or-carboxyl amidation, as described by Bongers et al., 

Int. J. Peptide Protein Res.. i£:268 (1992) utilizing an 
or amidating enzyme as described in Henriksen et al . , J. 
Am, Chem. Soc , 114:1876-1877 (1992); and Ohsuye et al., 
Biochem. Bioohvs. Res. Commun. , 150:1275-1281 (1988) . 

30 However, these techniques are limited to those 

modifications for which there exists a natural enzyme or 
chemical method capable of performing the desired 
modification. 

Amidation of peptides has been performed 

3 5 through protease catalyzed replacement reactions 

(transpeptidation) using an amino acid amide or peptide 
amide as a nucleophile. Sahina et al. # Chem . Pharm . 
Bull. . 3£:4345-4354 (1988 ); Sahina et al . , Chem. Pharm. 
Bull. . 27:811-812 (1989); Breddam et al., J. Peptide 
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Prgtein Res. , 37:153-160 (1991). Yields using these 
techniques are typically quite low. However, 
transpeptidation reactions catalyzed by serine or thiol - 
proteases, under appropriate reaction conditions, have 
5 been carried out in high yields. Breddam et al. (1991) 
cited supra . Although protease catalyzed 
transpeptidation can be very effective under some 
circumstances, it is limited to substrates for which a 
natural protease exists and which exhibits specificity 

10 for a peptide bond close to the Oterminus. 

Hence, there is a need to provide mutant 
protease enzymes capable of performing heretofor unknown 
N- or C-terminal modifications as well as peptide chain 
elongation with a variety of substrates, especially 

15 those substrates that are not reactive with the 
naturally occurring protease enzyme. 

Summary of the Invention 
These and other objectives are provided for by 

20 the method of the invention. The invention provides 
customized proteases (i.e., mutant proteases), methods 
of making customized proteases, as well as methods of 
using customized proteases. 

Customized proteases are derived from known 

25 proteases such as endoproteases, exoproteases, serine 

proteases and cysteine proteases. A customized protease 
is a modified version of a known protease designed to 
provide a protease that is capable of transacylating a 
preselected substrate with a preselected nucleophile in 

3 0 a transacylation reaction not substantially catalyzed by 
the known protease . The mutant or customized protease 
can also exhibit improved or enhanced yields of 
transacylation. The preferred preselected substrates 
are peptides having an acidic or basic amino acid at the 

35 penultimate position. The preferred preselected 

nucleophiles are amino acids and amino acid derivatives 
such as amino acid esters and amino acid amides. 
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The invention also provides methods for 
preparing a customized protease. These methods can 
involve site specific mutagenesis or random mutagenesis. 
Site specific mutagenesis can involve replacing a 
5 selected amino acid in the active site with a selected 
amino acid or by replacing the selected active site 
amino acid with any one of the 20 amino acids randomly. 
Random mutagenesis can involve replacing any amino acid 
of the active site with any of the other 19 amino acids . 

10 One method of the invention involves providing 

a DNA sequence that encodes the known protease, 
modifying at least one codon for at least one amino acid 
in the active site to form a mutant DNA sequence, and 
transforming a suitable host cell with the mutant DNA to 

15 provide for expression of the customized protease. In a 
preferred version, the codon for the amino acid in the 
active site is deleted using restriction enzymes and the 
deleted codon is replaced with an oligonucleotide 
encoding a different amino acid residue. 

20 Another method of the invention involves 

modifying a DNA sequence encoding the known protease by 
inserting stop codons and/or a restriction enzyme 
recognition site at targeted sites to form a modified 
DNA sequence encoding an inactive protease. The 

25 targeted site preferably includes a codon for an amino 
acid in the active site which is replaced by the stop 
codon. A mutant DNA strand is synthesized and amplified 
by incubating the modified DNA strand in the presence of 
synthetic enzymes and oligonucleotides and a first 

30 degenerate oligonucleotide. The first degenerate 

oligonucleotide contains a codon for a different amino 
acid in the targeted site and in place of the amino acid 
in the active site in the known protease. The mutant 
DNA strand is then selected and screened by detecting 

35 the presence of the customized protease. 

The invention also includes a method of using 
customized proteases to modify a preselected substrate 
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by transacylation. The transacylation reaction 
catalyzed by the customized protease is preferably not 
substantially transacylated by the known enzyme. A 
customized protease ins incubated with a preselected 
5 substrate and a preselected nucleophile to form a 

mixture. The mixture is incubated sufficiently to form 
a preselected transacylation product , preferably in high 
yield. 

Customized proteases according to the 
10 invention can be utilized for post translation 
modification of recombinant peptides. The 
transacylation products produced are modified by 
substitution at the C-terminal or N-terminal end with 
various nucleophiles (including L-amino acids, D-amino 
15 acids, amino acid amides , amino acid derivatives, amino 
acid esters and radioactive amino acids or peptide 
derivatives including two or more amino acids of which 
the terminal amino acid is a natural amino acid or an 
amino acid derivative) . It is understood that peptides 
20 produced by means other than recombinant technology can 
be transacylated according to the method of the 
invention. 



Detailed Description of the Invention 
25 The invention provides customized proteases 

(i.e., mutant enzymes), methods of making the customized 
proteases, as well as methods of using the customized 
proteases . 

The customized proteases of the invention are 
30 derived from known proteases and have transacylation 
capabilities differing from the known proteases. 
Altered transacylation capabilities include the 
capability to perform transacylation reactions not 
substantially catalyzed by the known protease (i.e. 
35 yields less than 10%) or the capability to perform 

transacylation reaction with improved or enhanced yields 
(i.e., 80 to 100%) or both. The mutant customized 
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proteases have been modified so that the protease can 
perform transacylation reactions with different 
preselected substrates and/or different preselected 
nucleophiles than the known protease. The mutant 
5 enzymes can also be specifically designed and selected 
to perform transacylation reactions with a specific 
preselected substrate and/or nucleophile. A known 
protease, preferably an exopeptidase, can be customized 
by replacement of amino acids in the active site so that 

10 the customized enzyme can transacylate different 
preselected substrates and/or nucleophiles. The 
customized protease can also exhibit enhanced or 
improved yields of transacylation. Preselected 
substrates are preferably those that have an acidic or 

15 basic penultimate amino acid. Preselected nucleophiles 
are preferably acidic or basic amino acid amides. 

The present disclosure will utilize the 
following terminology. This terminology is applicable 
to proteases including both carboxy- and amino 

20 peptidases, as well as endopeptidases . Schechter et 
al. , (1967) . 

The amino acids of the preselected substrates 
are designaced by the letter "P". The amino acids of 
the substrate on the N- terminal side of the peptide bond 

25 to be cleaved by a protease enzyme are designated 

P n ...Pj, P 2 , P l# with P n being the amino acid furthest 
from the catalytic apparatus of the protease. Amino 
acids of the preselected substrate on the carboxy side 
of the bond to be cleaved by the protease are designated 

30 P x # f P 2 ' , P 3 '...P n ' with P n ' being the amino acid furthest 
from the catalytic apparatus of the protease. The bond 
which is to be cleaved by the protease is the Px-P/ 
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bond. Hence, the generic formula for the amino acids of 
the preselected substrate are as follows: 

carboxypeptidase P n P^P 2 -'9 1 -'P l ' 

5 amino peptidase Px-P/ -P a ' -P 3 # P n '-. 

endopeptidase P n P 3 -P 2 -Pi-Pi' -P 2 ' -P 3 ' P n ' 

The "active site" of the protease is divided 
into a number of substrate binding sites and a catalytic 

10 apparatus. For example, the catalytic apparatus of 
serine proteases such as carboxypeptidase Y has a 
conserved catalytic triad of amino acids including 
serine, histidine and aspartic acid. The binding sites 
of the enzymes can include the S 1 binding site, the S L ' 

15 binding site which includes the C binding site. For 
carboxypeptidases, the S 1 binding site binds the side 
chain of the penultimate amino acid of the preselected 
substrate (PJ , the S x ' binding site binds to the side 
chain of the carboxy terminal amino acid (P/), and the 

20 C binding site binds the terminal a-carboxylate group. 
Modification of the active site preferably includes 
changes to amino acids in one or more of the binding 
sites. 

The terminology for the substrate binding site 
25 of a protease is analogous to that for describing the 

amino acids of the preselected substrate except that the 
substrate binding sites of the protease are designated 
by the letter "S". The substrate binding sites for the 
amino acids on the N- terminal side of the cleaved bond 
3 0 are labelled as S n ...S 3 , S 2 , S x . The substrate binding 
sites for amino acids on the carboxy side of the cleaved 
bond are designated by "S' n . These are labelled as 
S l ' S 2 ' . . .S n ' . Also, analogous to terminology of the 
amino acids on the substrate, the catalytic apparatus of 
35 the protease is understood to exist between the S 1 and 
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substrate binding sites. Hence, a generic formula 
for describing substrate binding sites of a protease is: 

S n S 2 -S 1 -S l ' -S 2 ' S n ' . 

5 

The customized proteases of the invention are 
capable of transacylating substrates. As used herein, 
transacylation means that the enzyme can catalyze a 
reaction in which a leaving group is exchanged for a 

10 nucleophile. Transacylation reactions include 

transpeptidation reactions as well as peptide elongation 
reaction. Transpeptidation as used herein, occurs when 
single or multiple amino acids or amino acid derivatives 
act as a leaving group and the nucleophile is a single 

15 amino acid or peptide or amino acid derivative. Peptide 
elongation as used herein involves replacement of a 
single amino acid with a peptide. Transacylation 
reactions also include peptide elongation if the leaving 
group is an alcohol and the nucleophile is a single or 

20 multiple amino acid unit. 

Customized proteases according to the 
invention can be utilized for post translation 
modification of recombinant peptides. The 
transacylation products produced are modified by 

25 substitution at the C- terminal or N- terminal end with 
various nucleophiles (including L-amino acids, D-amino 
acids, amino acid amides, amino acid derivatives, amino 
acid esters and radioactive amino acids or peptide 
derivatives including two or more amino acids'* of which 

30 the terminal amino acid is a natural amino acid or an 
amino acid derivative) . It is understood that peptides 
produced by means other than recombinant technology can 
be transacylated according to the method of the 
invention. 

35 The method of the invention provides for 

production of customized proteases through the process 
of site specific and/or random site mutagenesis. The 
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invention further provides for selection and screening 
of suitably modified customized protease that is capable 
of catalyzing the preselected transacylation reaction. 
Alternatively, the method of the invention provides a 
5 means of producing customized protease through the 

process of traditional mutagenesis. The invention also 
provides a process for utilizing the customized protease 
to transacylate a preselected substrate to form modified 
peptide products . 

10 

A, Customized Protease 

The invention provides for customized 
proteases. The customized proteases are derived from 
known proteases and have altered transacylating 

15 capabilities differing from the known proteases. As 

used herein, a customized protease is a modified version 
of a known protease designed to provide a protease that 
is capable of transacylating a preselected substrate 
with a preselected nucleophile in a transacylation 

20 reaction not substantially catalyzed by the known 
protease (i.e., less than 10%). The mutant or 
customized protease can also exhibit improved or 
enhanced yields of transacylation. Improved or enhanced 
yields are yields preferably increased to about 40 to 

25 80%, preferably about 80 to 100% over the yields of the 
known enzyme. Known proteases preferably include serine 
proteases, cysteine proteases and other endo- and 
exopeptidases . The especially preferred proteases are 
serine carboxypeptidases . Protease enzymes are capable 

30 of hydrolyzing preselected substrates as well as 

transacylating substrates in which the scissile bond is 
an ester and/or peptide bond. 

The general reaction for transacylation is 
shown below: 
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AA X -AA 2 -X + N 
preselected nucleophile 
substrate 

Transacylation can occur at the N or C- terminal end of 
the preselected substrate. While not meant to be a 
limitation of the invention, it is believed that an 
enzyme that catalyzes transacylation preferably can bind 
to or otherwise accommodate the nucleophile. The 
desired product of the reaction is designated the 
transacylation product. 

The general reaction for hydrolysis is: 

exo-protease 

AA^AAj-X — > AA 1 -AA 2 -OH + X 

substrate H 2 0 hydrolysis leaving 

product group 

Yields of a desired transacylation product can depend on 
the (1) leaving group of the substrate, (2) the 
25 nucleophile, and (3) the rates of competing side 

reactions for the particular substrate and nucleophile. 
The competing side reactions can include (1) hydrolysis 
of the substrate, (2) hydrolysis of the desired 
transacylation product, (3) formation of various 
30 undesirable side products through transacylation, e.g., 
addition of nucleophiles to peptides originating from 
hydrolysis of the substrate, addition of nucleophiles to 
the desired transacylation product, polymerization of 
transacylation products, etc. Undesirable side products 
35 can include the hydrolysis product of the substrate, the 
hydrolysis product of the desired transacylation 
product, and transacylation product of the substrate 
with one or more of the hydrolysis products. The 
preferred customized protease of the invention can 
40 catalyze the desired transacylation reaction in high 
yields (i.e., preferably about 40 to 100V and more 



> AA 1 -AA 2 -N + X 
transacylation leaving 
product group 



10 



15 



20 
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preferably 80 to 100%) , does not form substantial 
amounts of undesirable side products, and has a high 
affinity for the nucleophile. The preferred customized 
protease also does not substantially form undesirable 
5 transacylation side products, especially the 

transacylation product of a reaction of the hydrolysis 
product of the substrate with the substrate. As used 
herein, "substantially" means that the undesirable side 
products preferably are about 0 to 40% of the yield and 

10 more preferably 0 to 20% of the yield and most 
preferably 0 to 5% of the yield- Some of the 
undesirable side products can be removed from the 
reaction mixture by modifying the side product with an 
antigenic capping agent and removing the capped products 

15 with an affinity column. Side products can be modified 
with antigenic capping agents in a manner similar to 
that described in U.S. Patent No. 5,049,656 issued 
September 17 , 1991, wherein a method for modifying 
undesirable side products in automated peptide synthesis 

20 is described. 

A preselected substrate is preferably not 
substantially transacylated by the known protease. The 
term substantially as used herein, means the yield of 
the transacylation reaction with the known protease is 

25 preferably about 0 to 40% and more preferably about 0 to 
10%. The preselected substrate can be a naturally 
occurring peptide, a recombinant peptide, a synthetic 
, peptide or a peptide in which the C- terminal a-carboxyl 
group has been esterified or otherwise modified. The 

30 preselected substrate has a core peptide connected to a 
leaving group at a terminus of the core peptide. The 
portion of the preselected substrate from which the 
leaving group is removed and to which the nucleophile is 
added is the core. Suitable leaving groups are amino 

35 acids, small peptides, or alcohols. The preferred 

leaving groups are small apolar or hydrophillic amino 
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acids as well as moieties linked to the peptide core by 
an ester bond. 

The suitability of the preselected substrate 
is dependent on the substrate specificity of the 
5 protease as well as on the nature of the leaving group 
and the desired final products. While not meant to be a 
limitation of the invention, it is believed that the 
suitability of the leaving group is dependent on (1) the 
desired modification of the preselected substrate; (2) 
10 the substrate specificity of the customized protease; 
and (3) the manner in which the leaving group binds to 
the customized protease. 

A suitable preselected substrate for 
transacylation using a customized protease, according to 
15 the method of the invention, is of the general formula: 

P-A 

Wherein P represents the N-terminal or C-terminal core 
of the preselected substrate and A is the leaving group. 

20 The leaving group (A) can be an amino acid, an amino 
acid amide, a peptide, a peptide amide, or an alcohol. 
If A is an amino acid, amino acid amide, peptide or 
peptide amide, cleavage of A, from the core (P) is at 
the peptide bond. If A is an alcohol, cleavage of A 

25 from the core is at the ester bond. Preferably, the 
preselected substrate is not a substrate that is 
transacylated by the known protease with high efficiency 
(i.e., yields of about 0 to 40%). More preferably, the 
preselected substrate has an acidic or basic penultimate 

30 amino acid. Suitable preselected substrates include 
growth hormone releasing factor (GRF) and derivatives 
thereof, calcitonin and derivatives thereof, and 
glucagon-like peptide-1 (GLP-1) (SEQ ID NO:l) . 
Preferred preselected substrates are GRF (1-43) -Ala (SEQ 

35 ID NO:2), GRF (1-24) (SEQ ID N0:3), and GLP-1 (SEQ ID 
NO:l) . 



WO 95/20039 



PCT/US95/06682 



12 

A nucleophile, as used herein, is a molecule 
that donates a pair of electrons to an atomic nucleus to 
form a covalent bond. A suitable nucleophile can be an 
amino acid derivative, peptide derivative, ammonia or 
5 labelled compound which can be added to the core of the 
preselected substrate by the customized protease capable 
of substituting the leaving group for the nucleophile. 
A suitable nucleophile can also include agents that can 
be converted to achieve the desired modification of the 

10 transacylation product. For example, photonucleophiles 
such as those described by Buckardt can be added to the 
substrate by transacylation with a customized protease 
and the resulting transacylation product can 
subsequently be converted to an amide by application of 

15 light. A suitable nucleophile can be preselected based 
upon (1) the desired modification of the final product; 
and (2) the ability of the nucleophile to displace the 
leaving group on the preselected substrate. The 
preferred nucleophiles include amino acids and amino 

20 acid derivatives such as amino acid esters and amino 
acid amides. 

Customized proteases are rendered suitable for 
a chosen transacylation reaction through modification of 
the known protease at the "active site." Modifications 

25 of the mutant or customized enzyme can be site specific 
mutations designed to alter the "active site" of the 
protease so that it can act upon different preselected 
substrates and/or nucleophiles than the known protease. 
Modifications can include substitution, deletion, or 

3 0 insertion of one or more amino acids. The modifications 
can also be generated by random mutagenesis. 

Some of the amino acids in the active sites of 
proteases are known to those of skill in the art . When 
the amino acids in the active site of a protease are not 

3 5 known, amino acids equivalent to amino acids in known 
binding sites of proteases can be identified using 
standard methods. These methods include identification 
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of equivalent amino acids by reference to the primary 
and/or tertiary structure of an enzyme in that class of 
proteases. For example, a reference enzyme for 
carboxypeptidases is wheat carboxypeptidase (CPD-WII) . 
5 The primary amino acid sequence and the crystal 

structure of CPD-WII are known (Liao and Remington, J. 
Biol. Chem. . 265:6528 (1990)) and can serve as reference 
points to identify equivalent amino acids in other 
carboxypeptidases . 

10 The amino acid sequence and crystallographic 

structure of CPD-Y are known as well (Endrizzi et al . , 
Biochemistry, 33 :11106 (1994)) and can similarly be used 
as reference points to identify equivalent amino acids 
in other carboxypeptidases. 

15 One method that is applied to identify 

residues in the active site of a protease with an 
unknown tertiary structure is comparison of the amino 
acid sequence of the protease of interest with the amino 
acid sequence of a homologous protease with a known 

20 tertiary structure. Thus, this method can be used to 
identify amino acids in the protease of interest that 
are equivalent to amino acids in the active site of the 
homologous protease. For example, see Olesen et al. 
Protein Engineering . £:409 (1993). Alternatively, the 

25 amino acids in the active site can be identified by 
determination of the tertiary structure using X-ray 
crystallography or NMR techniques. 

In a preferred version, carboxypeptidase Y is 
modified by substitution of amino acids in the active 

30 site. These amino acids are preferably found in the S x 
or S x ' binding sites. Preferred amino acids of the S 1 
binding site include Tyrl4 7, Leul78, Tyrl85, Tyrl88, 
Asn241, Leu245, Trp312, Ile340 and Cys341. Preferred 
amino acids of the S x ' binding site include Trp49, Asn51, 

35 Gly52, Cys56, Thr60, Phe64, Glu65, Glul45, Tyr256, 

Tyr269, Leu272, Ser297, Cys298 and Met398. Amino acids 
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equivalent to these residues of the binding sites of 
carboxypeptidase Y are also preferred. 

Amino acid substitutions in the S x binding site 
can preferably result in a mutant protease capable of 
5 transacylating a preselected substrate with 5 basic or 
acidic penultimate amino acid (PJ . Amino acid 
substitutions in the S x ' binding site can preferably 
result in a mutant protease capable of performing 
transacylation reaction on preselected substrates with 

10 large apolar amino acid leaving groups and/or 

nucleophiles such as large apolar amino acids, proline 
and proline amide. The especially preferred enzyme is a 
customized carboxypeptidase that has different amino 
acid residues in a position equivalent to amino acid 

15 residue 178 or 3 98 of carboxypeptidase Y. A preferred 
substituent amino acid is serine at position 178. 

The preferred customized protease is a 
carboxypeptidase that is capable of transacylating a 
preselected substrate having an acidic or basic amino 

20 acid such as growth hormone releasing factor with a 
C- terminal alanine (GRF (1-43) -Ala) (SEQ ID NO: 2) and 
arginine as the penultimate amino acid. The especially 
preferred mutant carboxypeptidase catalyzes formation of 
growth hormone releasing factor with a leucine amide. 

25 Selection can also involve choosing the 

different amino acid that will be substituted into the 
active site. While not meant to be a limitation of the 
invention, one way the amino acid to be inserted into 
the active site can be selected is by predicting the 

30 effect on the binding interaction between the 

preselected substrates and the customized protease. 
These substitutions can be conservative amino acid 
substitutions. 

In general, the binding interaction between 

35 the substrate binding sites of the proteases and 

preselected substrate and/or nucleophile can directly 
affect the transacylation process. Although not 
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intended to be a limitation of the invention, it is 
believed that the interactions between the amino acids 
of the binding sites and the amino acids of the 
substrate effect formation of the enzyme substrate 
5 complex (ES) which is a first step to transacylation. 
For serine or thiol proteases, enzyme substrate 
interactions lead to the formation of a tetrahedral 
transition state intermediate and subsequently to the 
cleavage of the peptide (ester) bond between the 
10 substrate core and leaving group. The resultant acyl- 
or thio-acyl species then undergoes nucleophilic attack 
(aminolysis) to form the transacylated product. 

The binding interaction involved in formation 
of the ES complex include three major types: 
15 electrostatic interactions, hydrogen bonds, and van der 
Waal forces. Amino acid substitutions within the 
substrate binding sites can cause changes in the 
transacylation capabilities of the protease through 
changes in the binding affinity of an enzyme for a 
20 substrate (i.e. formation of ES complex), through 

modification of the interaction of the enzyme with the 
transition state as well as through interactions 
affecting the rate of competing aminolysis and 
hydrolysis reactions. Amino acid substitutions that 
25 affect each of these stages can be predicted based upon 
the preselected substrate with leaving group and the 
preselected nucleophile. The preferred substitutions 
include the substitution of Asn51 with glutamine and 
Leul78 with serine in carboxypeptidase Y. 
30 The mutant or customized protease can also be 

a protease that exhibits enhanced transacylating 
capabilities. Enhanced transacylating capability can be 
determined by determining an increase in the yield of 
the transacylation product. Preferably, the increase in 
35 the yield is about 40 to 100% and more preferably about 
80 to 100% increase over the yield catalyzed by the 
known protease. The preselected substrate can either be 
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a substrate that can be transacylated by the known 
protease but at low yields (i.e., about 10-40%) or a 
substrate not substantially cleaved by the known 
protease (i.e., less than 10%). 

5 

B . Methods for Preparing Customized Protease 

The method of the invention provides for 
preparing a customized protease derived from a known 
protease and that has a modified active site and that 

10 functions to alter the transacylation capability of the 
known protease. As used herein, the customized enzymes 
can be modified to transacylate a new substrate by 
mutating one of more amino acids in one or more of the 
substrate binding sites of a known protease. It is 

15 possible that mutation of as few as one amino acid in 
one substrate binding site can provide for 
transacylation of a substrate that was not a suitable 
substrate for the known protease. The active site can 
also be modified to provide an enzyme having enhanced 

20 transacylation capability, i.e., higher yields of 
transacylation . 

Mutation of amino acids of the substrate 
binding sites can alter one or more functionalities 
which affect transacylation of a preselected substrate 

25 by a customized protease such as: (1) affinity of the 
customized protease for the core peptide portion of the 
substrate; (2) affinity of the customized protease for 
the leaving group or nucleophile; and (3) preference of 
catalysis of aminolysis over the competing hydrolysis 

30 reaction. 

a . Providing a DNA sequence of known 
proteases to be mutated 

According to a method of the invention, 

35 mutation of a known protease to produce a customized 

protease can be accomplished through site specific 

mutagenesis, random site mutagenesis and traditional 

mutagenesis. The first two methods require knowledge of 
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the DNA sequence of known proteases and the location of 
codons which code for the substrate binding site amino 
acids. Amino acids in the active site of proteases are 
either known to those of skill in the art or can be 
5 identified by analogy to known proteases as described 
herein. The corresponding DNA sequence encoding the 
known protease and the location of codons for amino 
acids in the active sites are either known to those of 
skill in the art or can be derived from the amino acid 

10 sequence. For example, the DNA sequence and 
restriction map of the PRC1 gene which encodes 
carboxypeptidase Y (CPD-Y) and a source of the DNA 
sequence is described in Vails et al., Cell . 48 : 887-889 
(1987) . DNA sequences encoding known proteases can be 

15 obtained from an electronic database such as SwissProt, 
GeneBank, and EMBL. Once these sequences are 
identified, publications identifying vectors containing 
the DNA sequence can be located and used by those of 
skill in the art to prepare a customized protease. 

20 If the DNA sequence and location of substrate 

binding sites is not available for a known enzyme, the 
putative DNA sequence can be derived from the amino acid 
sequence of the known protease. The amino acid sequence 
of the known protease can be used to prepare synthetic 

25 oligonucleotide probes. The probes can be used to 

identify DNA sequences encoding the known protease in 
suitable organisms by standard methods as described by 
Maniatis et al, A Guide to Molecular Cloning (1989) . 
Once a DNA sequence encoding a known protease is 

30 isolated, codons corresponding to amino acids in the 
active site can be identified as described herein. 
Active site amino acids can be identified by comparing 
the primary or tertiary structure to other known active 
sites of other proteases or by X-ray crystallography as 

35 described herein. 
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b. Modifying amino acids in the active site 

Amino acids in the active site can be modified 
by modifying the codon encoding the amino acids in the 
DNA sequence encoding the known protease. Amino acids 
5 in the active site and the location of codons encoding 
these amino acids are either known to those of skill in 
the art or can be determined using standard methods. 
Amino acids in the active site preferably include those 
found in the S 1# S x * or C binding sites. The codon or 

10 codons encoding amino acids of the active site of the 
protease are included in a targeted site on the DNA 
sequence. The targeted site includes the DNA sequence 
that is going to be mutated. One or more than one codon 
can be changed in the targeted site. Optionally, the 

15 targeted site can also include the DNA sequence 

surrounding the codon or codons for an amino acid in the 
active site. The DNA sequence of the targeted site 
surrounding the codon preferably includes about 3 to 9 
nucleotides on either side of the codon or codons for 

20 amino acids in the active site. Modifications of codons 
include substitution, insertion or deletion of the 
codon . 

The codons encoding amino acids of the active 
sites are preferably modified to encode a different 

25 amino acid than that of the known protease. In site 
specific modification, a selected amino acid in the 
active site can be changed either randomly or by 
substitution with a codon for a specific amino acid. In 
random site modification any number of codons can be 

30 modified by substitution with any number of amino acids. 
The modification of the codon results in a customized 
protease with altered transacylation capabilities. 

The preferred codons for modification are 
those that encode amino acids in the S x or S x ' binding 

35 sites of carboxypeptidases . While not meant to be a 
limitation of the invention, it is believed that 
modification of amino acids in the S x site can result in 
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a mutant protease capable of transacylating a 
preselected substrate with a basic or acidic penultimate 
amino acid (PJ . Likewise, it is believed that a 
modification of amino acids in the S x ' binding site can 
5 result in a mutant protease capable of transacylating a 
preselected substrate with an acidic or basic leaving 
group and/or amino acid amides as nucleophiles. 

Choice of a specific amino acid substitution 
at a random or specific location can be based on the 

10 known or inferred mechanism of interaction of the 

binding site amino acid and the substrate. From this a 
rational inference is made, using knowledge of the 
properties of amino acids, of what amino acid 
substitution will provide the appropriate interaction to 

15 effect transacylation of the preselected substrate. 

Known amino acid properties which may be considered when 
selecting specific amino acid substitutions include 
electronic and steric factors. For example, specific 
amino acid substitution selection may be based on pKa 

20 values (of a-carboxyl and side chain hydrogens) , amino 
acid side chain length, and amino acid side chain 
polarity at various pH. While not meant to be a 
limitation of the invention, the effect of the amino 
acid substitution can be predicted based upon the 

25 interactions involved in binding and catalysis as 

described herein for carboxypeptidase . Alternatively, 
if no data is available from which to make a rational 
inference of specific amino acid substitutions, random 
selection of amino acid substitutions can be made both 

30 with respect to the amino acids of the active site to be 
changed and the amino acid substitutions to be made. 

According to the method of the invention, site 
specific and random site mutagenesis are used to mutate 
the known protease and can be accomplished through 

3 5 incorporation of an oligonucleotide containing a mutated 
or modified codon at the chosen or targeted codon 
location. Other methods of random and site specific 
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mutagenesis can be employed as described by Maniatis, 
cited supra . Preferred methods of incorporation of the 
oligonucleotide into the DNA sequence encoding the known 
protease to produce a modified DNA sequence include 
5 polymerase chain reaction (PCR) and standard -'Cloning 
technology. 

Oligonucleotides containing a mutated or 
modified codon can be obtained by standard methods . 
These standard methods include synthesis by automated 

10 methods. Methods for automated DNA synthesis are known 
to those of skill in the art. The synthetic 
oligonucleotides are comprised of a variable and a 
constant region and preferably are about 20 to 60 
nucleotides long. The length of the oligonucleotide is 

15 dependent on two main factors; (1) the number of 

variable regions the oligonucleotide is coding for; and 
(2) the length of the constant regions. 

The variable region of the oligonucleotide 
contains the nucleic acid codons which code for the 

20 mutated amino acids of the substrate binding sites. The 
codons for amino acids are known to those of skill in 
the art . The variable region of oligonucleotide can be 
designed to include a codon for a specific amino acid or 
any number of random amino acids. Therefore, the 

25 minimum number of codons in the variable region is 
three, which represents the codon for a single amino 
acid. The codons of the variable region correspond to 
the location of the codons to be mutated in the known 
protease. The variable region is flanked by the 

30 constant region of the oligonucleotide. If an 

oligonucleotide contains more than one variable region, 
there are constant regions between variable regions. 
The constant regions are necessary to incorporate the 
oligonucleotide into the customized protease gene and 

35 include codons corresponding to those of the known 

protease at that location (i.e., that are not mutated). 
The length of the constant region can depend on the 
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means by which the oligonucleotide will be incorporated 
into the customized gene and the number of amino acid 
modifications included in the variable region. 
Preferably, the constant region includes about 3 to 50 
5 nucleotides on either side of the variable region, and 
more preferably about 3 to 3 0 nucleotides on either side 
of the variable regions. 

Once formed, the synthetic oligonucleotides 
are incorporated into the DNA sequence for the known 

10 protease in frame and at the targeted location. One way 
this insertion can occur is by cleavage with at least 
one appropriate restriction endonuclease so that the 
targeted site is deleted, followed by ligation of the 
synthetic oligonucleotide into the site that was 

15 deleted. Appropriate restriction endonucleases can be 
determined by examining the nucleotide sequence around 
the targeted site and by the size of the synthetic 
oligonucleotide to be inserted at the site. The 
recognition sequences of restriction enzymes are known 

20 to those of skill in the art, and an appropriate 

combination of enzymes can be readily selected by one of 
skill in the art. 

In a preferred version, the codon for AsnSl of 
carboxypeptidase Y is modified to encode glutamine 51. 

25 The PCR1 gene encodes carboxypeptidase Y and can be 
obtained from plasmid pTSY3 which has been deposited 
with the American Type Culture Collection in Rockville, 
MD on October 26, 1993 and given Accession No. 7558 0. 
An oligonucleotide including a codon for glutamine at a 

30 site corresponding to the codon for AsnSl such as: 

GGATCCGGTCATCCTTTGGTTGCA^GGGGGT (SEQ ID NO: 4) (oligo 
N51Q) 

35 (underline indicates changes to the codon 

at the targeted site) 



40 



can be synthesized by automated DNA synthesis. The 
BamHI fragment of PRC1 includes the codon for amino acid 
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51. The DNA sequence of carboxypeptidase Y surrounding 
the codon for Asn51 can be deleted from the BamHI 
fragment of PRC1 with restriction endonucleases such as 
BstXI and Smal. The synthetic oligonucleotide can then 
5 be introduced in place of the deleted DNA sequence at 
the Smal-BstXI site of the BamHI fragment. The modified 
BamHI fragment is then inserted back into the entire DNA 
coding sequence for carboxypeptidase Y using known 
methods to form a modified DNA sequence. The sequence 

10 of the modified DNA sequence can be confirmed using 
dideoxy sequencing methodology. 

Once the modified DNA sequence is obtained, it 
can be introduced into a suitable host cell, selected 
and expressed to yield the customized protease with the 

15 modified active site and that functions to alter the 
transacylation activity of the known protease. 

The modified DNA sequence is preferably 
incorporated into a vector to provide for selection and 
expression. Suitable vectors include the yeast 

20 bacterial shuttle vectors YEp24, pRA21ABAM, pYSPl, 

pTSY3, pRA21, and pYSP32. The modified or mutated DNA 
sequence can be incorporated into the vectors by 
standard methods as described by Maniatis et al., cited 
supra . and Nielsen et al., AppI . Microbiol . Biotechnol . , 

25 33 :307 (1990) . 

Once combined with a vector, the vector is 
introduced into a suitable host cell for selection and 
expression. Suitable host sells include bacteria such 
as E. Coli and yeast such as 5. cerevisiae . Preferred 

30 host cells include S . cerevisiae strains having isogenic 
vpl mutations, delta-prcl mutations and ura3 mutations. 
Especially preferred hosts are S. cerevisiae strains 
that have vpl mutations resulting secretion of active 
CPD-Y as described in Nielsen et al., cited supra . 

35 The preferred vector is a plasmid pTSY3 which 

is the yeast bacterial shuttle vector YEp24 with a 
3.2 kb DNA insert containing the PRC1 gene under the 
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control of the PRC1 promoter. This plasmid has been 
deposited with the American Type Culture Collection, 
Rockville, MD, and given Accession No. 75580. 

Suitable host cells are transformed by 
5 standard methods including transformation of-- calcium 

phosphate, calcium chloride or lithium acetate competent 
cells, microinjection, electroporation, and the like. 
Transformed cells cam be selected based upon the 
presence of antibiotic resistance genes on the vector in 

10 the case of E. coli and based upon the presence of URA3 
in the case of yeast. Transformed yeast cells can be 
screened for the production of mutant protease activity. 
Transformed yeast cells producing mutant proteases can 
be screened by detecting the ability of the transformed 

15 cells to hydrolyze a peptide substrate using standard 
methods as described by Nielsen et al., cited supra . 
Mutant proteases that can perform transacylation 
reactions with a preselected substrate and/or 
nucleophile can be further selected by assaying for 

20 transacylation activity by standard methods including 
those described in Examples 2 and 3 . 

Once the transformed cells are selected and 
amplified, mutant proteases can be purified using 
standard methods such as high performance liquid 

25 chromatography and affinity chromatography as described 
in Example 1. 

In an alternative version, a novel method of 
the invention involves mutagenizing a known protease to 
form a customized protease having altered transacylating 

30 capability- The basic technique of the method involves 
in vitro DNA synthesis primed by mutagenic degenerate 
synthetic oligonucleotides. The method provides 
efficient screening of a large population of mutant • 
transformants which eliminates the wild type background 

35 due to unmutagenized plasmids in the subsequent 
functional screens. 
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The steps of the method include providing a 
DNA sequence encoding a known protease. DNA sequences 
encoding known proteases are either publicly available 
or can be obtained by standard methods as described 
5 herein. 

Once the DNA sequence encoding the known 
protease is obtained, one or more targeted sites are 
selected. A targeted site of the DNA preferably 
includes at least one codon for an amino acid of the 
10 active site to be modified as described herein. Once 
targeted sites are identified, the DNA sequence is 
modified at each targeted site by insertion of stop 
codons and optionally restriction endonuclease sites at 
the location to be mutated. Codons for stop codons are 

15 designated amber, ochre and opal, and the sequences of 
the stop codons are known to those of skill in the art. 
The DNA sequences recognized by a restriction 
endonuclease are known to those of skill in the art. 
The DNA sequence including . the restriction endonuclease 

20 site can be adjacent to the stop codon or it can overlap 
with the stop codon. The oligonucleotide sequence 
inserted at the target site can be prepared by standard 
methods including automated DNA synthesis. 

The inserted oligonucleotide sequence is 

25 preferably about 3 to 60 nucleotides long and can be 
inserted into one or more targeted site using standard 
methods such as in vitro DNA synthesis as described by 
Maniatis et al # cited supra . Once the oligonucleotide 
having a stop codon and optionally restriction 

3 0 endonuclease site is inserted into a targeted site of 
the DNA encoding the known protease, a modified DNA 
sequence encoding an inactive known protease is formed. 
The presence of stop codons results in the expression cf 
truncated forms of the known protease lacking activity. 

3 5 The DNA sequence encoding an inactive known 

protease is introduced into a vector, preferably a 
phagemid vector. The vector is transformed into 
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suitable host cells such as E. coli for amplification. 
Once amplified, the vector is isolated and single 
stranded DNA can be prepared. Optionally, the DNA 
sequence can be introduced into a vector carrying an 
5 inactive antibiotic resistance gene such as a gene 
encoding ampicillin resistance that has a frameshift 
mutation. A preferred phagemid vector is the pYSPl. 

A mutant DNA strand encoding the customized 
protease can be synthesized by incubating the single 

10 stranded DNA with one or more first degenerate 

oligonucleotides in the presence of DNA polymerase and 
DNA ligase. A first degenerate oligonucleotide has 
variable and constant regions as described previously 
herein. The variable region of a degenerate 

15 oligonucleotide includes at least one mutated codon for 
an amino acid in the active site of the known protease 
and that has been targeted. The mutated codon is found 
at the same location with respect to the surrounding DNA 
as the codon for the amino acid in the known protease. 

20 An oligonucleotide is degenerate if the mutated codons 
are randomly changed to encode any one of the 20 amino 
acids. A degenerate oligonucleotide at the mutated 
codon has the sequence of NNN wherein N corresponds to 
any one of the four nucleotides. A degenerate 

25 oligonucleotide preferably includes about 10 to 50 
nucleotides on both sides of the mutated codon. The 
codons of the constant region correspond to the codons 
of the known protease at the targeted location in the 
known protease. Degenerate oligonucleotides can be for 

30 any of the 20 amino acids and are randomly generated 
using known methods and as described in Olesen et al . , 
cited supra . 

When the DNA sequence encoding the known 
protease contains more than one targeted site that has 

35 been modified by stop codons and/or restriction 

endonuclease sites, it may be desirable to synthesize a 
mutant DNA strand that is modified at only one or some 
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of the targeted sites- When mutation is desired only at 
some of the targeted sites having stop codons, the other 
targeted sites in the mutant enzyme should have the 
sequence of the known enzyme at these other targeted 
5 sites. One or more second oligonucleotides can be 

included in the DNA synthesis mixture that function to 
ensure that the other targeted sites that are not to be 
mutated in the mutant DNA strand have the sequence of 
the known enzyme as described in Olesen et al., cited 

10 supra . The second oligonucleotides include codons 

corresponding to those in the known protease that have 
been replaced by stop codons and optionally restriction 
endonuclease sites at targeted sites. Each second 
oligonucleotide has the same sequence at a specific 

15 targeted site as the known protease. A second 
oligonucleotide preferably has about 20 to 60 
nucleotides. 

The synthesis mixture can also optionally 
include an oligonucleotide that provides for repair of 

20 an antibiotic resistance gene. When the DNA sequence 
encoding the known protease is inserted into a vector, 
that vector preferably includes a mutated antibiotic 
resistance gene such as ampicillin resistance. 
Including one or more oligonucleotides that can repair 

25 the mutation of the antibiotic resistance gene in the 
DNA synthesis mixture results in a mutant DNA strand 
that has a functional antibiotic resistance gene. An 
oligonucleotide that can repair the antibiotic 
resistance gene includes codons that provide for the 

3 0 correct DNA sequence at the relevant mutation in the 
resistance gene. These oligonucleotides are known to 
those of skill in the art or can be prepared by standard 
methods as described in herein. 

Once synthesis of the mutant DNA is complete, 

35 desirable mutant enzymes encoded by the DNA can be 
selected and screened for the ability to act on a 
preselected substrate for alteration of transacylation 
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capability. The mutant DNA can be selected and 
amplified first in a suitable host cell such as EL. coli 
followed by introduction into a suitable host cell that 
can secrete proteases such as S. cerevisiae strains. 
5 Selection methods for transformed cells include 
selecting for antibiotic resistance based upon the 
presence of an antibiotic resistance gene on a vector. 
Methods for selection in yeast using selectable marker 
genes are known to those of skill in the art. 

10 Transformed cells can be screened to identify 

cells having the customized protease with the desired 
functional activity. The desired customized protease 
can include a protease that can hydrolyze a preselected 
substrate having an acidic or basic penultimate amino 

15 acid, a protease that can catalyze transacylation 

reaction in which a preselected substrate is modified by 
a preselected nucleophile, and a protease with enhanced 
transacylation capabilities. The preferred customized 
protease is a carboxypeptidase that can modify a 

20 preselected substrate with a basic penultimate amino 
acid with a terminal leucine amide. One or more 
screening and selecting methods can be employed. 

For example, mutant enzymes capable of 
hydrolyzing a preselected substrate are first selected 

25 and screened for hydrolysis activity and then those 
selected mutants are screened for transacylation 
activity. Substrate hydrolysis is used as a first level 
selection to ensure that the mutant is capable of acting 
on the preselected substrate. The mutant enzymes, 

30 capable of hydrolyzing the preselected substrate are 
then further screened for transacylating capability. 
Selection of transf ormants expressing hydrolysis 
activity of the preselected substrate can be performed 
using the plate activity method or the color overlay 

35 method. For detection of low level customized 
proteases, the color overlay method is preferred. 
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According to the invention, transformed cells 
expressing customized proteases can be selected for 
preselected substrate hydrolysis activity using a plate 
activity method. This method of the invention utilizes 
5 a bacterial or yeast host cell which requires an amino 
acid for growth. The amino acid required for growth is 
provided to the transformed host cells as a C-terminal 
amino acid of a peptide. By culturing the transf ormants 
on a media deficient in the leaving group amino acid, 

10 only those transf ormants capable of releasing the 

leaving group from the peptide substrate can grow. This 
method is described in Olesen et al., Protein Eng. . 
cited supra . For example, a vps strain of S. cerevisiae 
which requires Leu for growth is used to select for 

15 transformants expressing a mutant CPD-Y gene capable of 
cleaving a terminal Leu from a preselected substrate 
such as N-blocked-X-Leu-OH dipeptides {X « Glu, Gly, 
Ser, His, Pro, Trp, or Lysj . Only cells which express a 
protease which can release the terminal Leu from these 

20 substrates can grow on these plates. 

A preferred method fox* selection of 
transformants which express a customized protease 
capable of hydrolyzing a preselected substrate is the 
color overlay method. According to this method, 

25 trans formant colonies are overlaid by agar containing a 
chromogenic substrate which reveals customized protease 
activity. The chromogenic substrate turns color upon 
reaction with the product formed by catalytic action of 
the customized protease. In a preferred version, the 

30 transformed cells are incubated with a preselected 
peptide or amino acid substrate. The preselected 
peptide or substrate such as N-acetyl-L-alanine 
/3-naphthyl ester is acted upon by transformed cells 
expressing a customized protease and the reaction 

35 product is detected by simultaneously overlaying the 
transformed cells with a chromogenic agent such as 
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Garnet Red that changes colors upon exposure to the 
reaction products. 

Customized proteases which have been mutated 
and screened for the capacity to act on a preselected 
5 substrate can be further screened for transacylation 
capability. One method that can be employed to screen 
for transacylation capability is that described in 
Examples 2 and 3. Customized proteases are purified 
from the transformed cells by methods known in the art 

10 and as described in Examples 2 and 3. The purified 
customized protease is mixed with the preselected 
substrate under conditions favorable for transacylation 
such as pH of about 7 to 9.5 and in the presence of a 
suitable nucleophile. Transf ormants which produce 

15 customized proteases capable of catalyzing 

transacylation of the preselected substrate can be 
identified by following the appearance and/or amount of 
the desired reaction product by standard methods. 

In a preferred version, the PRC1 gene encoding 

20 carboxypeptidase Y is modified by insertion of DNA 
sequences at target sites as follows: 

TAA GCT TCC (SEQ ID NO: 5) at Tyrl47 ochre stop codon 

Hindlll 

25 

TGA ATT CCT (SEQ ID NO: 6) at Leul78 opal stop codon 

EcoRI 

TAG CCC GGG TGT (SEQ ID N0:7) at Glu2l4, Arg216 
30 amber stop codon 
Smal 

TAA ATC GAT ACC (SEQ ID NO:8) at Ile340, Cys341 
ochre stop codon 
35 Clal 



The target sites in carboxypeptidase were selected based 
on homology to carboxypeptidase -WI I . The modified DNA 
40 sequence encodes an inactive carboxypeptidase Y. The 

DNA sequence as modified is amplified as phagemid vector 
PYSP32 in E . cqlj. The preferred vector also contains 
an inactive frame shifted ampicillin resistance gene. 
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Single stranded DNA of pYSP32 can be generated by 
standard methods. 

The single stranded DNA can be incubated with 
up to four first degenerate oligonucleotides, each 20 to 
5 60 nucleotides long and containing degenerated codons at 
the center. A preferred first degenerate 
oligonucleotide for position 178 includes the sequence: 
GGC NNN ACT 

wherein NNN is situated at the center of the 
10 oligonucleotide and corresponds to the location of the 
codon for amino acid 178. NNN is a codon for any of the 
20 amino acids. Degenerate oligonucleotides can be 
synthesized by automated synthesis . 

One or more second oligonucleotides can be 
15 included in the preferred synthetic mixture and in which 
case they each replace one of the first 
oligonucleotides. The second oligonucleotide includes 
codons for amino acids at positions 147 , 178, 215 and 
216, and 340 and 341 of carboxypeptidase Y, as follows: 
20 TCC TAC GCC (SEQ ID NO: 9) Tyrl47 

TTG GAA CGT TGT (SEQ ID NO: 10) Glu215, 

Arg2l6 

TTC ATC TQT ACC (SEQ ID NO: 11) Ile340, 

Cys341 

25 GGC CTC ACT (SEQ ID NO: 12) Leul78 

The presence of these second nucleotides ensures the 
mutant DNA strand is synthesized without stop codons at 
other targeted sites. One preferred mutagenesis mixture 

30 contains a first degenerate oligonucleotide for position 
178 and a second oligonucleotide for each position of 
Tyrl47, Glu215, Arg216, Xle340 and Cys341. 

Optionally, the preferred DNA synthesis 
mixture also contains one or more third 

35 oligonucleotides, ampicillin repair oligonucleotides. 
The repair oligonucleotides provide for synthesis of a 
mutant DNA sequence having a functional ampicillin 
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resistance gene operably linked to the DNA sequence 
encoding the mutant or customized protease. 

The mixture of single stranded DNA and 
oligonucleotides is incubated in the presence of 
5 precursor nucleotides, DNA polymerase and DNA ligase. 
Mutant DNA sequences encoding the customized protease 
are formed and can optionally be linked to a functional 
ampicillin resistance gene. The mutant DNA is 
introduced into a suitable mismatch repair deficient 

10 bacteria for amplification and selection and 

subsequently introduced into yeast for screening. The 
preferred suitable yeast cell is a S. cerevisiae strain 
that has a vps mutation, and/or requires at least one 
amino acid for growth. Especially preferred 

15 cerevisiae strains include W2579, K2579LLR and JHRY20- 
2CA3 . 

The transformed E. coli cells can be selected 
first for the ability to grow in the presence of 
ampicillin. Subsequently, transf ontiants are further 

20 selected for the ability to grow on a medium deficient 
in the amino acid required for growth and in the 
presence of the peptide substrate containing the amino 
acid for growth at the C- terminus. For example, a host 
cell that cannot normally grow without leucine is 

25 transformed with mutant DNA and plated onto leucine 

deficient medium supplement with a preselected peptide 
having the following formula: 

where A is leucine and P x is an amino acid not 
3 0 sufficiently acted on by known protease such as any one 
of Glu, Gly, Ser, His, Pro, Trp or Lys. Transf ormants 
that can grow on leucine deficient medium supplement 
with a preselected polypeptide such as P n -Lys-Leu can act 
on the preselected polypeptide to release leucine, 
35 thereby providing a source of leucine for growth. These 
expressed proteases are then screened for the ability to 
transacylate a preselected polypeptide such as P n -Arg-Ala 
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where P n is growth hormone releasing factor and leucine 
amide . 

Customized proteases which favor aminolysis 
over hydrolysis cam be produced using traditional 
5 mutagenesis. According to this method of the invention, 
mutation of the amino acid composition of a known 
protease is accomplished by subjecting DNA or cells 
containing a vector encoding a DNA sequence for known 
protease to a mutagenic agent such as UV light, 

10 nitrosoguanidine, ethylmethyl sulfonate, bisulfite, 

dimethyl sulfate, formic acid, hydrazine, hydroxylamine, 
methoxylamine, nitrous acid, potassium sulfate, and 
others. Methods of traditional mutagenesis are known in 
the art and are described, for example: for chexnicai in 

15 vi tro mutagenesis: Myers et al., Science . 229 :242-247 
(1985); Hayatsu, Methods Enzvmol . , 15:568-587 (1976); 
Shortle et al., Methods Enzvmol . . 100:457-568 (1983); 
Kadonga et al., Nucl . Acids . Res . . 13:1733-1745 (1985); 
Busby et al., J. Mol . Biol.. 154:197-209 (1982); and 

20 Loeb, Cell . 4£:483-484 (1985); for nucleotide 

mis incorporation: Romberg, DNA Replication , Freeman, 
San Francisco (1980) at page 724; and Kunkel et al . , J. 
Bjol. phem., 251:5718-5725 (1979); for incorporation of 
nucleotide analogs: Flavel et al., J. Mol . Biol . . 

25 £2:255-272 (1974); Dierks et al., Cell . 22:659-706 

(1983)1 Dodson et al . , PNAS , 22:7440-7444 (1982); Eadie 
et al., Nature , 303. = 201-203 (1984); Grossberger et al., 
MS/ 21:5689-587 (1981); and Mott et al., Nucl. Acids 
Res. . 1^:4139-4152 (1984). 

30 The mutated vectors are then incorporated into 

a suitable expression system and the expressed 
customized enzymes are selected and screened. The 
methods for selecting and screening customized protease 
produced by traditional mutagenesis can be performed as 

3 5 described above for transformants produced by site 

specific and random site mutagenesis. Plate activity 
and color overlay selection can be utilized to select 
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for those transf ormants which express a customized 
protease capable of acting on a preselected substrate. 
Those customized proteases capable of acting on a 
preselected substrate are purified and mixed with a 
5 preselected substrate and nucleophile under., conditions 
favorable for transacylation to screen for enzymes 
capable of catalyzing the transacylation of the 
preselected substrate. 

10 c. Method of traneacylating substrates 
uiHnq rMiqt-rmH ze d proteases . 

The invention also provides a method for using 
the customized enzyme to transacylate a preselected 
substrate with a preselected nucleophile. This method 

15 is useful to add nucleophiles such as D-amino acids, 
modified amino acids and radiolabelled amino acids to 
the termini of recotnbinantly produced peptides to form 
transacylation products. This method can also be 
applied to naturally occurring or synthetic peptides to 

20 form useful analogs or derivatives. 

The customized protease of the invention is 
designed to either enhance transacylation capabilities 
(i.e., yields) or act on a preselected substrate and/or 
nucleophile poorly accepted by the known proteases. The 

25 customized protease can be prepared and selected by the 
methods described herein. The preselected substrate is 
selected depending on the desired transacylation 
product. The preselected substrate is preferably not 
substantially transacylated by the known proteases. 

30 "Substantially" in this context means that the yields of 
transacylation with the preselected substrate and with a 
particular nucleophile are preferably about 0 to 10% 
under standard conditions. The preselected substrate 
preferably has a basic or acidic amino acid as the 

35 penultimate amino acid. The preferred amount of the 
preselected substrate depends on the substrate 
specificity of the enzyme for the substrate as well as 
the solubility of the substrate in the chosen solvent 
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and is about 0.2 to 10 mM. A preselected substrate can 
be a naturally occurring peptide, a synthetic peptide or 
a recombinantly produced peptide. 

The preselected nucleophile is preferably not 
5 an effective nucleophile with the known protease. The 
nucleophiles are preferably amino acids, radioactively 
labelled amino acids, and amino acid amides. 
Nucleophiles can be prepared by standard synthetic 
methods known to those of skill in the art such as 

10 described in Breddam et al., Int. J. Peptide Res. . 

32:153-160 (1991) . A preferred amount of a nucleophile 
also depends on the affinity of the enzyme and 
solubility of the nucleophile in the chosen solvent and 
is about 10 mM to 2M. 

15 Reaction conditions resulting in high yields 

of the desired product can vary with a given enzyme 
substrate system. Reaction conditions can be altered to 
minimize degradation and polymerization of the products. 
Such side reactions may, when using ester substrates 

20 together with a serine carboxypeptidase, be avoided by 
increasing the pH above 8.0 when aqueous solvents are 
employed. Alternatively^ side reactions can be avoided 
by conducting the reaction in am organic solvent. 

Transacylation can be performed in aqueous 

25 buffer solution. Preferred buffer solutions include 
50mM HEPES and 5mM EDTA, pH 7.5 or 50mM CHES and 5mm 
EDTA, pH 9.5. It is of importance that the chosen 
buffer is unable to act as a nucleophile in the 
transacylation reaction. The preferred pH for 

30 transacylation using an alcohol leaving group is 

preferably about pH 6.5 to 9.5 and more preferably pH 
7.5 to 8.5. The preferred pH for transacylation using 
an amino acid or peptide derivative leaving group is 
preferably about pH 5.5 to 8.5, and more preferably 

3 5 about preferably pH 6.5 io 7.5. 

The production of the transacylation product 
is monitored by HPLC or other appropriate analytical 
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technique. The reaction can be stopped by addition of 
an acidic solution to bring the pH of the reaction 
mixture down to about pH 1 to 3 . Alternatively, the 
reaction can be stopped by addition of an enzyme 
5 inhibitor such as phenyl methane sulfonyl fluoride 

(PMSF), or diisopropyl phosphoryl fluoridate (DFP) . The 
transacylation product can be separated from the 
reaction mixture by reverse phase chromatography, 
hydrophobic interaction chromatography, ion exchange 

10 chromatography, or HPLC. 

Alternatively, the transacylation reaction can 
be performed in organic solvents for those enzymes 
substrate systems capable of functioning in organic 
solvents. Suitable organic solvents for the 

15 transacylation reaction include dimethyl sulfoxide 
(DMSO) , N,N' -dimethylacetamide and other similar 
solvents. The methodology for transacylation in organic 
solvents is described in Bongers et al., Int. J. Peptide 
Protein Res. . 40:268 (1992) . 

20 In a preferred example for transpeptidation 

using an amino acid leaving group in aqueous solution, 
the preselected peptide substrate, GRF (1-43) -Ala (SEQ 
ID NO: 2), is dissolved in a 5% solution of acetic acid. 
The nucleophile, leucine amide, is dissolved in 50 mM 

25 HEPES , 5 mM EDTA to a final concentration of 500 mM. 
25 /il of a 40 mM solution of GRF (1-43) -Ala (SEQ ID 
NO: 2) is added pr. 950 /xl of nucleophile solution and 
the pH is add to 7.5 at 20°C. The customized protease 
is added to the mixture in 25 /il of water pr. ml 

3 0 solution, resulting in an enzyme concentration of about 
0.002 to 0.07 mg/ml. The reaction is followed by HPLC 
and is stopped when no additional product is formed by 
the addition of one volume of 2.5% trif luoracetic acid. 
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EXAMPLE 1 

Site Specific Mutagenesis of 
Carboxvpeptidase Y Active Site 

The three-dimensional structure of 
carboxvpeptidase Y suggests that the side chains of 
Trp49, AsnSl, Gly52, Cys56, ThrSO, Phe64, Glu65, Glul45, 
Tyr256, Tyr269, Leu272, Ser297, Cys298 and Met398 are 
important in the active site of the enzyme. These amino 
acid residues were mutated by site specific mutagenesis 
to form enzymes with single, double, or triple 
mutations . 

The following mutants of carboxvpeptidase Y 
have been constructed using standard methods: 



AsnSl has been replaced with: 



Glu 6 5 has been replaced by: 
Glul45 has been replaced by: 



Alanine (N51A) 
Cysteine (N51C) 
Glycine (N51G) 
Valine (N51V) 
Aspartic Acid (N51D) 
Glutamic Acid (N51E) 
Glutamine (N51Q) 
Serine (N51S) 
Threonine (N51T) 

Alanine (E65A) 
Glutamine (E65Q) 

Alanine (E145A) 
Glutamine (E145Q) 
Serine (E145S) 
Asparagine (E14 5N) 
Aspartic Acid (E145D) 



35 



Trp4 9 has been replaced by: Phenylalanine (W4 9F) 



In addition, enzymes having more than the 
mutation as shown below have been constructed: 
40 N51A + E145A 

E65A + E145A 
N51A + E65A + E145A 
E65Q + E145Q 

The general methodology used to construct the mutant 
4 5 enzymes is by site specific mutagenesis using the 
polymerase chain reaction. 
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For example, a plasmid pUC-of3 0 was constructed 
by inserting a 1112 bp BamHI fragment of the PRC1 gene 
from pYTS3 containing the coding region for all amino 
acid residues involved in the formation of the active 
5 site into the unique BamHI site in the polylinker of 
pUC19. Yanisch-Perron et al., Gene , 33:103 (1985); 
Stevens et al. Yeast Cell Biolocrv . Ed. J. Hicks, New 

York, Alan R. Liss, at pages 519-536 (19 ) . The 

orientation of the fragment was opposite of the lacZ 

10 gene. pUC-a3 0 contains unique BstXI, EcoRI, Nael and 
Smal restriction sites which can be used in cloning and 
mutagenesis procedures. 

The mutations W49F and N51A were made by the 
polymerase chain reaction (PCR) (Innes et al., 1990) in 

15 a Perkin Elmer Cetus DNA Thermal Cycler using a Gene Amp 
kit (Perkin Elmer Cetus) on pUC-a30 with 
GTTTCTGTCCTTGTGAGACAAAATTTCAGA (SEQ ID NO: 13) (oligo 
wtll) and with either GGATCCGGTCATCCTTTTCTTGAACGGG (SEQ 
ID NO: 14) (oligo W49F) or 

2 0 GCAAAGGATCCGGTCATCCTTTGGTTGGCAGGGGGCCA (SEQ ID NO: 15) 

(oligo N51A) . Nucleotides underlined are different from 
wild-type. Cleavage with BstXI allowed insertion of the 
PCR fragment into a Smal -BstXI vector fragment of 
pUC-c*30. 

25 The mutation E145A was made by PCR with 

GCAAGGCGATTAAGTTGGGT (SEQ ID NO: 16) (oligo pUC19 spl) 
and GGCGTAGGAAGCCCCAGCGAT (SEQ ID NO: 17) (oligo E145A) 
on pUC-a30. Cleavage of the PCR fragment with EcoRI 
allowed introduction into a Nael -EcoRI vector fragment 

30 of pUC-a30. 

The mutations E65A and N51A+E65A were produced 
by fusion of two overlapping PCR fragments using either 
pUC-a30 or pUC-a30-N51A as template. Fragment 1 was 
generated with CTGTTCTTTGCATTAGGACCC (SEQ ID NO: 18) 

35 (oligo E65A) and (oligo wtl) and fragment 2 with (oligo 
pUC19sql) and (oligo E145A) . An additional PCR reaction 
was performed on the fused fragment with oligo pUC19sql 
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and oligo wtl. The resultant fragment was cut with 
EcoRI and BstXI, thus, removing the unwanted mutation on 
position 145, and ligated into a pUC-of3 0 vector fragment 
cut with the same restriction enzymes. 
5 N51A+E145A, E65A+E145A, and N51A+E65A+E145A 

were made by proper combination of the above listed 
mutations exploiting the EcoRI site in the polylinker 
and exploiting that BstXI cleaves between position 65 
and 145. 

10 The mutated sequences were introduced into the 

PRC! gene by transferring the mutated 1112 bp BamHI 
fragment into the vector pRA21ABam. The fragment 
inserted into pRA21ABam was controlled for the absence 
of any non-silent secondary mutation by sequencing using 

15 the Taq Dye-Dideoxy™ terminator cycle sequencing kit and 
the model 373A DNA- sequencing system from Applied 
Biosystems, USA. 

Site directed mutagenesis on position 51 and 
145 was performed using polymerase chain reaction and 

20 restriction endonuclease cleavage as described herein. 
The following oligonucleotides were used in the 
mutagenesis reactions : 

GCAAAGGATCCGGTCATCCTTTGGTTGGACGGGGGCCA (SEQ ID NO: 19) 
25 (oligo N51D) , 

GGATCCGGTCATCCTTTGGTTG£AAGGGGGCCA (SEQ ID NO: 20) (oligo 
N51E) , 

30 GGATCCGGTCATCCTTTGGTTGCAGGGGGT (SEQ ID N0:21) (oligo 
N51Q) , 

GGATCCGGTCATCCTTTGGTTGAGTGGGGGT (SEQ ID NO: 22) (oligo 
N51S) , 

35 

GGATCCGGTCATCCTTTGGTTGACTGGGGGT (SEQ ID NO: 23) (oligo 
N51T) , 

GGCGTAGGAATCCCCAGCGAT (SEQ ID NO:24) (oligo E145D) , 

40 

GGCGTAGGAATTCCCAGCGAT (SEQ ID NO: 25) (oligo E145D) 

GGCGTAGGAXXSCCCAGCGAT (SEQ ID NO:26) (oligo E145Q) , 

GG CGTAGG ATG AC C CAG CGAT (SEQ ID NO: 27) (oligo E145S) . 

45 
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Underlined nucleotides are different from wild-type. 
All fragments generated by the PCR reaction were ligated 
into pUC-a30 after cleavage with the appropriate 
restriction enzymes such as Eco RI (E145) or Bst XI 
5 (Asn51) . The absence of any non-silent secondary 
mutation was confirmed by sequencing using the Taq 
Dye-Dideoxy™ terminator cycle sequencing kit and the 
model 3 73A DNA- sequencing system from Applied 
Biosystems, USA. 

10 The mutant enzymes containing cysteine (N51C) , 

valine (N51V) , or glycine (N51G) , glutamine (E65Q) , and 
asparagine (E145N) were prepared in a similar manner. 

Once the DNA sequence encoding the PCR enzyme 
was mutated as described herein, it was transferred into 

15 the vector pRA21ABam . This vector was then introduced 
into S. cerevisiae strain (W2579 (prcALEU2 ura 3-51, 
pep 4-3 ) as described by I to et al . J. Bacteriol . . 
153 : 163-168 (1983) using lithium acetate. The stability 
of plasmids in stationary-phase yeast cells was 

20 determined by plating culture samples on YPD medium (1% 
yeast extract [Difco] , 2% peptone [Difco] and 2% 
glucose) and subsequent replica-plating of about 100 
colonies to SC medium lacking uracil. Yeast cultures 
were grown in MU/pro medium plus 50 mM potassium 

25 phosphate. 

Mutant enzymes were purified from a one liter 
culture grown under the conditions previously described. 
(Nielsen et al., 1990). Growth media containing 
secreted mutant enzyme was adjusted to pH 4.4 with 

30 concentrated acetic acid and then directly applied to 
the GYBS-Sepharose affinity column according to the 
procedure of Johansen et al. (1976). N51A, which did 
not bind effectively to GYBS-Sepharose, was subjected to 
diaf iltration against 10 mM NaH 2 P0 4 , pH 7.0, using a 

35 Pellicon system (Millipore) and then purified by ion 
exchange chromatography on a DEAE Fractogel 650 column 
(2.6 x 6 cm) . The column was washed until A 280 was below 
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0.01 and elution was accomplished with a linear salt 
gradient from 0 to 0.5 M NaCl in 10 mM NaH 2 P0 4 , pH 7.0. 
The elute was concentrated -using an Amicon cell and 
applied to a Sephacryl-S300 column (1 cm x 100 cm) 
5 equilibrated with 50 mM NaH 2 P0 4/ pH 7.0. Fractions with 
constant specific activity were pooled, concentrated and 
dialyzed against water. All enzyme preparations were 
stored frozen in water at -18°C. 

The purity of the mutant enzymes was 
10 ascertained by SDS-PAGE on 12.5% homogeneous gels using 
the PhastSystem from Pharmacia, The concentration of 
CPD-Y mutants was determined spectrophotometrically 
using A 2e0 (1 mg/ml = 1.48) (Johansen et al . , (1976) 
cited supra ) . 

15 Once prepared, the mutant or customized 

enzymes can be evaluated for a change in the 
transpeptidation reaction. 

EXAMPLE 2 

20 Carboxypeptidase Y Mutants With 

Improved Characteristics in Transacylation Reactions 
Using Amino Acids as Nucleophiles 

The binding site in carboxypeptidase Y (CPD-Y) 
25 for the negatively charged C- terminal carboxylate group 
of peptide substrates has been identified using site 
directed mutagenesis as described herein. While not 
meant to be a limitation of the invention, it is 
believed that the carboxylate group of the peptide 
30 substrate binds to the side chains of AsnSl and Glul45 
in the S/ binding pocket. Both side chains can act as 
hydrogen bond donors. The side chains of AsnSl and 
Glul4 5 appear to be oriented by hydrogen bonds with 
Glu65 and Trp49 which, therefore, have an indirect 
3 5 function in the binding of the carboxylate group of 
peptide substrates. 

Serine carboxypeptidases also catalyze the 
hydrolysis of peptide esters and this activity increases 
with pH and remains constant in the pH range 7 to 9.5. 
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Thus, at basic pH, the esterase activity is high and the 
peptidase activity is low. These unique properties, 
combined with an ability to catalyze transacylation 
reactions with amino acids or amino acid derivatives as 
5 nucleophiles, suggest that serine carboxypeptidases can 
be useful in peptide synthesis. However, many 
substrates and nucleophiles cannot be catalyzed by known 
enzymes. With amino acid methyl esters as nucleophiles, 
medium yields (40-80%) are obtained but this method is 

10 complicated by the risk of further elongation of the 

product (oligomerization) . It would be valuable to use 
amino acids rather than amino acid amides as 
nucleophiles . 

It has been demonstrated that mutant 

15 carboxypeptidase enzymes can bind to and catalyze 

peptide elongation with amino acids as nucleophiles in 
higher yields. With amino acids as nucleophiles, yields 
exceeding 60% are obtained in a few cases but yields of 
10-40% are much more common and H-Pro-OH, H-Glu-OH and 

20 H-Asp-OH are not accepted as nucleophiles. Thus, the 
yields obtained with amino acids as nucleophiles are 
rarely satisfactory. The low yields with amino acids as 
nucleophiles are not due to degradation of the product 
since the reaction is carried out at basic pH where the 

25 peptidase activity is very low (see above) , thus, 
securing accumulation of the peptide product in the 
reaction mixture. 

Mutants of carboxypeptidase Y were examined 
for the capacity to transacylate certain substrates 

30 using amino acids as nucleophiles. Some amino acid 
substitutions in the active site of mutant 
carboxypeptidase Y enzymes were also made knowing that 
they were- not likely to improve the yields of 
transacylation reactions, but rather to investigate the 

35 mechanism of action of the binding and catalysis with 
certain types of nucleophiles. 
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CPD-Y was obtained from Carlbiotech, 
Copenhagen, Denmark. Amino acids and buffers were from . 
Bachem, Switzerland or Sigma, USA. The mutants E65A, 
E65Q, E145A, E145Q, E61A+E145A, E65Q+E145Q, N51S, N51Q, 
5 N51A, and N51A+N145A were prepared as described in 

Example 1. The purity of the enzymes was ascertained by 
SDS polyacrylamide gel electrophoresis. 

Aminolysis reactions were carried out in the 
following way. The nucleophile was dissolved in 50 mM 

10 Hepes, 5 mM EDTA and pH was adjusted to 7.5. Five (5) 

/il of substrate (8 mM FA-Ala-OBzl in methanol) was added 
to 190 pi nucleophile solution followed by 5 /il enzyme, 
resulting in a substrate concentration of 0.2 mM. For 
reactions carried out at pH 9.5, Hepes was replaced with 

15 Ches. During the reaction, 20 /il aliquots were removed 
from the reaction mixture and added to 50 /il 1% 
trif luoroacetic acid to quench the reaction. The 
reactant composition was determined by HPLC using a 
Waters HPLC equipped with a C-18 Waters Novapac 4/z 

20 reverse phase column and various gradients of 
acetonitrile in 0.1% trif luoroacetic acid. The 
separation was monitored at 3 02 nm allowing the direct 
quantification of the products from the integrated peak 
areas. The composition of the reaction mixture was 

25 determined at least twice during the reaction, the first 
time when 20-50% (preferably 35%) of the ester substrate 
had been consumed in the reaction and the second time 
when 50-90% (preferably 80%) of the substrate had been 
consumed. The products were collected and identified by 

30 amino acid analysis after acid hydrolysis using a 
Pharmacia Alpha Plus analyzer. 

Further identification was obtained by co- 
chromatography of authentic standard compounds. The 
fraction of aminolysis (fa) was expressed as the ratio 

3 5 between the formed aminolysis product and the sum of all 
products being formed, i.e., unconsumed substrate was 
disregarded in the calculations. The K NUpp) , representing 
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the nucleophile concentration at which fa is half the 
maximum value (a measure for the dissociation constant 
of the nucleophile) , and fa^ (the highest possible fa 
obtained at saturation of the enzyme with nucleophile) 
5 were determined by fitting the values of fa obtained at 
a minimum of seven concentrations of nucleophile to a 
saturation. The value of fa obtained at the highest 
possible nucleophile concentration is designated f a aac . 
The ability of CPD-Y to catalyze 

10 transacylation reactions with amino acids or amino acid 
derivatives acting as nucleophiles in competition with 
water can be studied. A study of the beneficial effects 
of structural alterations within the S 1 ' binding site on 
the productive binding of amino acids to effect 

15 aminolysis is preferably performed with an ester 

substrate. Transacylation reactions should preferably 
be performed at slightly basic pH to maximize the 
esterase activity and minimize the peptidase activity. 
When amino acids are used as nucleophiles the product 

2 0 peptide is very slowly degraded by the enzyme and, as a 
consequence, it accumulates in the reaction mixture, 

FA-Ala-OBzl is hydrolyzed at very high k^/J^ 
by CPD-Y and the prepared mutant enzymes allowing the 
use of low concentrations of enzyme (0.5 /ig/ml) . The 

2 5 fact that the peptide products are hydrolyzed at much 
lower k c , t /K„ prevents degradation of the aminolysis 
product. At pH 7.5, with H-Val-OH as added nucleophile, 
two products were formed: FA- Ala -OH (hydrolysis) and 
FA-Ala-Val-OH (aminolysis) . At each concentration of 

30 nucleophile, the fraction undergoing aminolysis reaction 
was constant with time and independent of the 
concentration of substrate remaining in the reaction 
mixture. This indicates that the product FA-Ala-Val-OH 
is not hydrolyzed by the enzyme under the employed 

35 reaction conditions and, thus, the ratio of the two 

products reflect the relative rates of the two competing 
reactions. With increasing concentrations of H-Val-OH, 



WO 95/20039 



PCT/US95/06682 



44 

fa increased but it did not exceed 0.32. The 
correlation between the fraction of aminolysis and 
concentration of H-Val-OH was unaffected by the presence 
of 0.5 M NaCl in the reaction medium. Hence, the ratio 
5 of rates of the hydrolysis and aminolysis reaction is 
independent of the ionic strength of the reaction 
medium. 

The observation that CPD-Y becomes saturated 
with H-Val-OH is consistent with the previous 

10 demonstrations that nucleophiles bind to the acyl -enzyme 
intermediate of serine carboxypeptidases prior to the 
deacylation reaction. The correlation between fa and 
the concentration of nucleophile is consistent with K NUpp} 
and fa^ values of 13 mM and 0.32, respectively. The 

15 reaction was also carried out at pH 9.5 and the values 
for Ka^pp, and fa MX were 19 mM and 0.31, respectively, 
suggesting that the ratio of the two forms of the 
nucleophile (amino/ammonium) has little influence on the 
synthesis parameters. 

20 The influence of the side chain of the amino 

acid nucleophile on fa^ and K^pp, was investigated. 
K^pp, decreased in the following order: 

H-Gly-OH > H-Ser-OH > H-Lys-OH > H-Gln-OH > 
25 H-His-OH > H-Ala-OH > H-Val-OH > H-Leu-OH > 

H-Phe-OH. 

(See Table I) . With H-Pro-PH and H-Glu-OH, no 
aminolysis product was obtained. Thus, hydrophilic 
amino acids bind much less efficiently than hydrophobic 

3 0 ones. The highest fa max values were obtained with the 
amino acids that bind the least efficient: with 
H-Gly-OH, H-Ala-OH, H-Ser-OH, and H-Arg-OH, the fa^ 
values exceeded 0.8, whereas with the hydrophobic and 
efficiently binding H-Val-OH, H-Leu-OH and H-Phe-OH, 

35 values below 0.4 were obtained. 



WO 95/20039 



PCT/US95/06682 



45 
TABLE I 

CPD-Y Catalyzed Aminolysis of an Ester Substrate 
Using Amino Acids as Nucleophiles 

The Influence of Amino Acid Structure on 
the Parameters for the Aminolysis Reaction 



Nucleophile fa aat fa,^ K^app) 



H-Gly-OH 


0. 


60 


0 


.92 


± 


0. 


01 


1500 


± 50 


H-Ser-OH 


0. 


78 


0 


.87 


± 


0 . 


01 


310 


± 20 


H-Lys-OH 


0. 


79 


0 


.81 


± 


0 . 


03 


190 


± 30 


H-His-OH 


0, 


20 


0 


.27 


+ 


0 . 


01 


87 


± 9 


H-Gln-OH 


0. 


64 


0 


.90 


± 


0. 


02 


150 


± 8 


H-Glu-OH 


0 






n. 


d. 






n 


.d. 


H-Ala-OH 


0. 


85 


0 


.91 


+ 


0. 


02 


83 


± 7 


H-Val-OH 


0. 


30 


0 


.32 


+ 


0 . 


01 


13 


± 1 


H-Leu-OH 


0. 


32 


0 


.35 


± 


0. 


01 


10 


± 1 


H-Phe-OH 


0. 


14 


0 


.15 


± 


0 . 


01 


9.5 


± 1 


H-Pro-OH 


0 






n. 


d. 






n.d. 



All reactions were performed at pH 7.5 as described herein. The 
values for K^pp, are in mM. The concentrations at which fa„ e was 
obtained were : 

H-Gly-OH m 2.9 M, H-Ser-OH « 3.2 M, H-Lys-OH « 2.7 M, 
H-His-OH = 0.25 M, H-Gln-OH - 0.37 M, H-Glu-OH s 1.1 M, 
H-Ala-OH = 1.4 M, H-Val-OH « 0.48 M f H- Leu- OH » 0.14 M, 
H-phe-OH - 0.14 M and H-Pro-OH = N.D. 



With amino acid amides as nucleophiles, fa MX 
consistently exceeds 0.85 and is essentially independent 
of the hydrophobicity of the side chain. 

While not meant to be a limitation of the 
5 invention, the low values observed with some amino acids 
might be explained by the interaction between enzyme and 
the cr-carboxylate group of the amino acid when it is 
combined with tight binding of its side chain to the 



WO 95/20039 



PCTAJS95/06682 



46 

enzyme . The interaction between enzyme and 
a-carboxylate group of some amino acid nucleophiles 
apparently had an adverse effect on fa wx suggesting that 
alteration of this interaction could have a beneficial 
5 effect. Amino acid nucleophiles could occupy a position 
similar to that of the P x ' amino acid residue of peptide 
substrates. If this is the case, then the amino acids 
in CPD-Y involved in the binding of the C-terminal 
carboxylate group of peptide substrates would also be 

10 important for the binding of the a-carboxylate group of 
amino acid nucleophiles. The binding of the C-terminal 
carboxylate group of peptide substrates is dependent on 
hydrogen bonds from the side chain of AsnSl and Glul45, 
the latter with the carboxylic acid group in its 

15 protonated form, and the optimal position of these two 
side chains is secured by hydrogen bonds from Glu65 and 
Trp49. However, this is the situation at low pH where 
the enzyme efficiently binds peptide substrates: at pH 
7.5 where the aminolysis reactions were carried out, 

20 Glul45 is in its deprotonted negatively charged form 
which cannot participate in the binding of the 
a-carboxylate group. In fact, this negative charge may 
substantially influence the binding mode of the amino 
acid nucleophile. Hence, the transacylation reactions 

25 might be influenced by mutational replacements of AsnSl 
and Glul45 and possibly also, due to indirect effects, 
by replacement of Glu65. 

CPD-Y mutants with replacements at positions 
51, 65 and 145 were investigated for their ability to 

30 catalyze transacylation reactions using H-Val-OH and 
H-Leu-OH as nucleophiles (Table II) . 
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TABLE II 

Carboxypeptidase Mutants with 
Improved Characteristics of Acylation 



H-Val-OH H-Leu-OH 

Mutant fa^ K. {App) fa^ 



Wild-type 0.32 ± 0.01 13 ± 1 0.35 ± 0.01 10 ± 0. 

N51A+E145A 0* n.d. 0* n.d. 

N51A 0.02* n.d. 0* n.d. 

N51S 0.05* n.d. 0.03 a n.d. 

N51Q 0.97 ± 0.02 61 ± 3 0.96 ± 0.01 42+1 

E145A 0.48 ± 0.01 79 ± 6 0.30 ± 0.01 40+2 

E145Q 0.68 ± 0.02 41 ± 4 0.66 + 0.06 29+8 

E65A 0.91 ± 0.01 69 ± 2 0.96 ± 0.01 42+1 

E65Q 0.91 ± 0.02 75 ± 4 0.97 ± 0.01 56 ± 1 

E65A+E145A > 0.8 > 500 > 0.8 > 0.8 

E65A+E145Q 1.00 ± 0.03 240 ±15 > 0.8 > 0.89 



All reactions were performed at pH 7.5 as described herein. 
The values for K,,,.,^, are in mM. 
* The low values prevented the demonstration of saturation of the 
enzyme and consequently they must be regarded as f a« at values . 



The complete removal of the hydrogen bond donating 
capacity of the side chains at positions 51 and 145 by 
incorporation of Ala at both positions, i.e. N51A+E145A, 
abolished the capacity of the enzyme to accept amino 
5 acids as nucleophiles . The same result was obtained 
with the single mutant N51A suggesting that AsnSl is 
important for the binding of amino acid nucleophiles 
prior to the attack on the acyl-enzyme intermediate. 
However, with the other single mutant E145A, significant 
10 aminolysis was obtained and this is consistent with the 
notion that Glul45 at pH 7.5 plays a minor role in the 
binding of amino acid nucleophiles. Nevertheless, with 
H-Val-OH and H-Leu-OH as nucleophiles, K N(app) was 4-5 
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times higher using E145A as compared with the wild- type 
enzyme . 

It is common to all the positions 65 and 145 
mutants, i.e. E65A, E65Q, E145A, E145Q, E65A+E145A and 
5 E65Q+E145Q, that K,,,.^, with H-Val-OH and H-Leu-OH 

increased by a factor of 5 to 10. Simultaneously, an 
increase in fa^ was observed except with E145A. With 
E65A and E65Q, the fa^, values exceeded 0.90 and, as a 
result, it is possible with these enzymes to obtain much 

10 higher yields when hydrophobic amino acids are used as 
nucleophiles . Although not meant to be a limitation of 
the invention, Glu65 and Glul45 do not appear to be 
directly involved in the binding of the or-carboxylate 
group of amino acid nucleophiles at the pH where the 

15 reaction is carried out, they seem to exert an indirect 
influence such that their replacement affect the binding 
mode, as indicated by the elevated fa^ values. When 
amino acid amides are used as nucleophiles, the presence 
of the negatively charged Glul45 does not appear to 

20 exert a negative effect since the fa MX values 
consistently are high. 

Asn51 was replaced with other amino acid 
residues and these enzymes were tested. Replacing AsnSl 
with Ser or Gin affected fa^ in different directions. 

25 With N51S and H-Val-OH and H-Leu-OH as nucleophiles, 
f*ma* was 0.05 and 0.03, respectively (Table II). With 
N51Q, they were 0.97 and 0.96, respectively, and with 
the wild-type enzyme, fa^ was 0.35 with both 
nucleophiles . 

30 With FA-Phe-OMe as substrate and H-Val-OH as 

nucleophile, the same influence of the group at position 
51 on fa^ was observed (data not shown) . The kinetic 
parameters for the hydrolysis of FA-Phe-Val-OH by these 
mutants are: k cac = 1500 min* 1 , K„ = 0.085 mM, 

35 k CAC /K M = 18000 min' 1 mM* 1 (N51Q) ; k cac » 8100 min' 1 , = 

0.17 mM, kcae/Kn = 48000 min* 1 mM' 1 (N51S) as compared with 
k cat = 6500 min* 1 , K„ = 0.047 mM, k CttC /K„ = 140000 min* 1 mM* 1 
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(wild- type) . Normally, in serine protease catalyzed 
peptide bond hydrolysis, the acylation step is rate- 
limiting. However, this is not the case in CPD-Y 
catalyzed hydrolysis of FA-Phe-X-OH substrates (X = 
5 amino acid residue) since the k cat values sh6w only 

little dependence on the structure of X. Thus, the rate 
of the deacylation step exerts an influence on K^. The 
k cac values for the hydrolysis of FA-Phe-Val-OH increase 
in the order N51Q < wild-type < N51S and this correlates 

10 inversely with fa^ values with H-Val-OH as nucleophile 
which decrease in the order N51Q > wild type > N51S. 

The results in Table II show that, among the 
mutants tested, the highest fa^ values were obtained 
with N51Q and, as a consequence, this enzyme was deemed 

15 the most suitable supplement to the wild-type enzyme as 
a catalyst in this particular type of peptide synthesis. 
This was further investigated by determination of the 
K^pp, and fa^ values for a number of different amino 
acid nucleophiles using 51Q {Table III) and the results 

20 may be compared to those obtained with the wild-type 
enzyme. For H-His-OH and the hydrophobic amino acids, 
the fa lflax values were significantly higher with N51Q as 
compared with the wild-type enzyme. However, with some 
amino acids, the AsnSl -» Gin had no or only small effect 

25 on fa MJt , i.e., those where fa,^ with the wild-type enzyme 
was zero or exceeded 0.9. With all nucleophiles, K^^p, 
was significantly higher with N51Q as compared with the 
wild-type enzyme and this had the consequence that fa^ 
with some amino acids could not be reached due to 

30 limited solubility of the nucleophile. The value of fa 
obtained at the highest possible concentration of 
nucleophile, i.e. fa flac , has been listed in Table III and 
this value represents the highest possible yield that 
can be obtained in aqueous solution at pH 7.5. A 

35 comparison of the fa sat values obtained with the wild- 
type enzyme leads to the conclusion that N51Q is 
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superior to the wild- type enzyme with the hydrophobic 
amino acids as nucleophiles . 
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EXAMPLE 3 

Alteration of Leaving Group Preference for 
Mutant Carboxypeptidase Y 
Catalyzed Transpeptidation Reactions 

5 

With ester substrates, serine 
carboxypeptidases exhibit no dependence on the nature of 
the leaving group. However, with peptide siibstrates 
where an amino acid acts as leaving group, this is not 

10 always the case. With carboxypeptidase Y, the most 

commonly employed serine carboxypeptidase, high yield of 
transpeptidation is only achieved when the leaving group 
is a hydrophilic amino acid. However, since 
carboxypeptidase Y in hydrolysis reactions exhibits a 

15 preference for hydrophobic amino acid leaving groups (P x ' 
amino acids) , it would be beneficial, due to higher rate 
and specificity of the reaction, if such leaving groups 
were permissible in transpeptidation reactions as well. 
This would also permit modification of peptides and 

20 proteins, as isolated from natural sources, which 

presently are excluded due to hydrophobic C- terminal 
amino acid residues. The low yields due to the 
described leaving group dependence has prevented further 
development of a method for incorporation of labelled 

25 amino acid residues into peptides and proteins with the 
purpose of facilitating amino acid sequence 
determinations . 

Mutants of carboxypeptidase Y were examined 
for the capacity to enhance yields of transacylation 

3 0 with substrates having leaving groups that are not 

hydrophilic. Some amino acid substitutions made in the 
active site of mutant carboxypeptidase Y enzymes were 
also made knowing they were not likely to improve the 
yields of transacylation reactions, but rather to 
35 investigate the mechanism of action of the leaving group 
dependence . 

Carboxypeptidase Y was obtained from 
Carlbiotech, Copenhagen, Denmark. All amino 
acids/amino acid derivatives were purchased from Bachem, 

4 0 Switzerland. The mutations in the structural gene for 
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carboxypeptidase Y were carried out as described in 
Example I. The mutants N51S, N51Q, E65A, and E145A were 
isolated as described in Example I and their purity was 
ascertained by SDS polyacrylamide gel electrophoresis 
5 performed on a Pharmacia Phast-system. The substrates 
FA- Ala -Gin -OH, FA-Ala-Arg-OH, FA-Ala-Lys-OH, 
Hippuryl-L-Phe-OH, and Hippuryl-L-/3-Penyllactic acid 
were purchased from Bachem, Switzerland. The 
preparation of the following substrates were carried as 

10 previously described: FA-Ala-OBzl, FA-Ala-Ala-OH, 
FA-Ala-Val-OH, FA- Ala-Leu -OH, and FA-Ala-Phe-OH . 
Breddam et al., Carlsberg Res. Comm. , 49:535 (1984). 

Aminolysis reactions were carried out in the 
following way. The nucleophile was dissolved in 50 mM 

15 HEPES, 5 mM EDTA and pH was adjusted to 7.5, 5 f-il 
substrate (8 mM FA-Ala-OBzl or FA-Ala~Xaa-OH in 
methanol) was added to 190 jil nucleophile solution 
followed by 5 ^1 enzyme diluted in buffer to the 
appropriate concentration, resulting in a substrate 

20 concentration of 0.2 mM. During the reaction, 20 /il 
aliguots were removed from the reaction mixture and 
added to 50 pi 1% trif luoroacetic acid to quench the 
reaction. The reactant composition was determined by 
HPLC using a Waters HPLC equipped with a C-18 Waters 

2 5 Novapac 4ji reverse phase column and various gradients of 

acetonitrile in 0.1% trif luoroacetic acid. The 
separation was monitored at 302 nm allowing the direct 
quantification of the products from the integrated peak 
areas. The products were collected and identified by 

3 0 amino acid analysis after acid hydrolysis (Pharmacia 

Alpha Plus analyzer) and by co- chromatography of 
authentic standards. The fraction of aminolysis (fa) 
was expressed as the ratio between the formed aminolysis 
product and the sum of all products being formed, i.e. 
3 5 unconsumed substrate was disregarded in the 

calculations. The K^.pp (a measure for the dissociation 
constant of the nucleophile) and fa^ (the highest 
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possible fa obtained at saturation with nucleophile were 
determined by fitting the obtained fa values to the 
equation fa = f a raax / { 1+K^pp/N) . 

The enzymatic hydrolysis of FA-Ala-OBzl and 
5 the peptide substrates was followed on a Perkin Elmer X9 
spectrophotometer. The substrates were dissolved in 
methanol at a concentration of 0.5 to 8 mM. A total of 
25 /il of substrate solution was added to 955 /xl of 50 mM 
HEPES, 5 mM EDTA, pH 7.5, to give initial substrate 

10 concentrations in the cuvette, s 0 . of 1.25 to 200 /iM. 

The cleavage of the substrate was followed with time by 
monitoring the decrease in absorbance at 337 nm and from 
this the initial rates of hydrolysis v 0 . was determined. 
For FA-Ala-OBzl, the k cat /K„ value was determined by 

15 fitting to the following form of the Michaelis-Menten 
equation: v 0 = e 0 (k^/Kj / (l/K* + l/s 0 ) . For the 
peptide substrates, the k^Kn values were determined by 
fitting to the equation v 0 = (k cat /K„) e o s 0 . which is valid 
when Sq^Km. All fits of the parameters k cac /K„, fa^ and 

20 K N a PP were performed using the Grafit program 

(Leatherbarrow, R.J., Grafit version 3.0, Erithacus 
Software Ltd., Staines, U.K., 1990). 

While not meant to be a limitation of the 
invention, it is believed that in CPD-Y catalyzed 

25 hydrolysis reactions, a substrate acylates the essential 
serine residue which subsequently is deacylated by 
water, hence completing the hydrolysis reaction. When 
an amine nucleophile, e.g., an amino acid or amino acid 
amide, is added to the reaction mixture, the acyl-enzyme 

3 0 will be partitioned between water and the added amino 

component, in the latter case forming a new peptide bond 

{ transpeptidation) . 

The saturation curves can be described by 

equation (1) . 

3 5 fa^ 

fa = 

KN.app (1) 
1 + 

N 
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The upper limit of the fa, termed f^, is 
reached when N^K^pp. fa^ can reach a maximum value of 
1. This is, however, rarely obtained in practice. The 
concentration at which fa^/2 is reached, termed K N app , 
5 describes the affinity of the nucleophile for the enzyme 
and the dissociation of the aminolysis product. 

Peptide esters may function as substrates and, 
in this case, reaction with the amine component causes 
elongation of the peptide. The ratio of the hydrolysis 

10 to aminolysis reaction is not influenced by the nature 
of the alcohol leaving group, but that there is a 
pronounced effect of the nucleophile on fa tnax . Hence, 
fa tnax values ranging from 1.00 with H-Gly-NH 2 to 0.15 with 
H-Phe-OH as nucleophile are observed. 

15 When a peptide functions as substrate, in 

which case the C-terminal amino acid residue is 
exchanged with the added amino component, the 
hydrophobicity of the amino acid leaving group exerts a 
pronounced influence on the hydrolysis/aminolysis ratio. 

20 With H-Val-NH 2 or H-Gly-NH 2 a nucleophile and FA-Ala- 
XAA-OH (XAA = H-Arg-OH, H-Lys-OH, H-Gln-OH, H-Ala-OH, 
H-Val-OH, H-Leu-OH, and H-Phe-OH) as substrates, the 
observed fa^ values reached 1.00, 0.99, 0.93, 0.94, 
0.22, 0.16, and 0.056, respectively (see Table IV). The 

25 differences in fa max suggests that the rate of 
dissociation of the amino acid leaving group is 
comparable to that of hydrolysis of the acyl -enzyme with 
the leaving group bound. Furthermore, they coincide 
with the observation that the fa max values obtained with 

30 the ester substrate and H-Arg-OH, H-Lys-OH, H-Gln-OH, 
H-Ala-OH, H-Val-OH, H-Leu-OH, and H-Phe-OH as 
nucleophiles are 0.87, 0.81, 0.90, 0.91, 0.32, 0.35, and 
0.15, respectively, and thus similar to the fa tnax values 
obtained when these amino acids act as leaving groups 

3 5 (see Table IV) . 
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TABLE IV 

Influence of the Amino Acid Nucleophile/Leaving Group 
On the fa^ Values Obtained with Wild-Type CPD-Y 



FA-Ala-OBzl + • FA-Ala-Xaa-OH 

H-XA-OH H-Xaa-OH H- Val/Gly-NH 2 



H-Gln-OH 0.90+0.03 0 . 93 ± 0 . 01* 

H-Lys-OH 0.81 + 0.03 0.99 ± 0.01* 

H-Arg-OH 0.87 + 0.01 1.00 ± 0 . 02 b 

H-Ala-OH 0.91+0.02 0.94 + 0.01 b 

H-Val-OH 0.32 + 0.01 0.22 + 0.0l b 

H-Leu-OH 0.35+0.01 0.16 + 0.01 b 

H-Phe-OH 0.15 + 0.01 0.06 + 0.01 b 



* H-Gly-NH 3 was used as nucleophile and 

b H-Val-NH 2 was used as nucleophile. 

Using FA-A-OBzl as substrate, the observed fa„. was 1.00+0.01 with 
H-Gly-NH 2 and 0.98+0.01 with H-Val-NH,. 



It has previously been suggested that low 
yields in aminolysis reactions were due to degradation 
of the products, a theory based on the fact that any 
product of an aminolysis reaction will be a substrate 
5 for CPD-Y. However, in the reactions studied here with 
FA-Ala-OBzl as substrate, the k. cat /K„ for the hydrolysis 
of the products, under the conditions used in the 
aminolysis reactions, are 50-100 fold lower than that of 
the substrate (data not shown) . Consequently, no 

10 degradation of the aminolysis products is detected with 
FA-Ala-OBzl as substrate and FA- Ala-Xaa-OH (Xaa = 
Ala-OH, Val-OH, Leu-OH, Val-NH 2 ) as product. 

In order to be able to alter yields obtainable 
with a specific amino acid acting as leaving 

15 group/nucleophile, it is helpful to know what 

contributes to low fa^ values. While not meant to 
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limit the invention, it is believed that the wide range 
of results with amino acids as leaving groups indicates 
that the magnitude of the rate constants which enter the 
expression for fa^ are associated with the nature of 
5 the amino acid side-chain. It appears that the more 
hydrophobic the leaving group, the lower the fa^,. A 
significant increase in fa^ can be achieved by 
structural alterations within the binding pocket for the 
side-chain of the P x ' amino acid residue. Alternatively, 

10 modification of the interaction between the 

a-carboxylate group of the leaving group or nucleophile 
and the binding site for the C-terminal carboxylate 
group (AsnSl, Glu65 and Glul45) will influence fa^. 
Alterations within this region result in CPD-Y 

15 derivatives with changed fa max values in transacylation 
reactions with FA-Ala-OBzl as substrate and various 
amino acid nucleophiles . 

Aminolysis reactions were carried out with the 
substrates Hippuryl-L-Phe-OH (peptide bond, the 

20 N- terminal of the leaving group is a H 2 N- group) and 
Hippuryl-L-0-Phenyllactic acid {ester bond, the 
N-terminal of the leaving group is a HO- group) , using 
H-Gly-NH 2 as nucleophile. These two reactions gave us a 
possibility to evaluate whether the nature of the Phe 

25 side -chain carboxyl- terminal or the amino group 

determines the low fa^ value observed with Phe. The 
aminolytic parameters with phenylalanine or L-0- 
Phenyllactic acid, respectively, as the leaving group 
and H-Gly-NH 2 as nucleophile were determined: fa MX were 

30 found to be 0.33±0.0l and 0.96+0.01, respectively, and 
K Ntapp to be 0.63±0.06 and 0.47±0.05, respectively. Thus, 
these reactions suggest that the nature and positioning 
of the a-amino group contributes to the low yield 
obtained with some amino acids. 

35 While not meant to be a limitation of the 

invention, it appears that the conformation in which an 
amino acid binds within the S 1 ' site may facilitate or 
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restrain the access of water to the acyl component 
covalently attached to the essential serine residue 
(Serl46) due to the positioning of the amino group of 
the leaving group/nucleophile . It should, thus, be 
5 possible to achieve increased fa^ values if the binding 
mode of a specific amino acid within the S,' binding site 
is altered to prevent nucleophilic attack of water on 
the acyl-enzyme. 

The reaction FA-Ala-Xaa OH + H-Val-NH 2 - 

10 FA-Ala-Val-NH 2 + H-Xaa-OH was studied with these mutants. 
With the mutant N51Q, the fa^ values were 0.98, 0.91, 
and 0.8 0 with Xaa = Ala, Val, Leu, respectively. . 
Similar results were obtained in the reaction FA- Ala - 
OBzl + H-Xaa-OH (Xaa = Ala, Val, Leu, and Phe) 

15 FA-Ala-Xaa-OH + HOBzl (see Table V) , where the fa^ 

values with Xaa « Val and Leu are 4-5 fold higher than 
those obtained with the wild-type enzyme. The 
observation that N51Q exhibits increased fa^ values in 
trajispeptidation reactions is probably due to changes in 

20 the binding of amino acids within S/. The mutation 
results in increased reaction with amine relative to 
water. This result would be consistent with a shorter 
distance between the acyl-enzyme and the a-amino group 
of the nucleophile/leaving group. 



25 
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TABLE V 



Influence of the Amino Acid Nucleophile on 
fa^ and Values Using FA-Ala-OBzl as Substrate 

and CPD-Y and Various Mutants 



Enzyme Nucleophile fa,*, Km.-pp K »i.*pp/f a m« 

(mM) (mM) 



Wild type H-Ala-OH 0.91+0.02 83 ± 7 91 

H-Val-OH 0.32 ± 0.01 13 ± 1 43 

H-Leu-OH 0.35 ± 0.01 10 ± 1 2 9 

H-Phe-OH 0.15 ± 0.01 10 ± 1 



N51S 



H-Ala-OH 
H-Val-OH 
H-Leu-OH 
H-Phe-OH 



0.56 



±0.05 
n.d. b 
n.d. b 
n.d. b 



140 + 4 

n.d. b 
n.d. b 
n.d. b 



250 



n.d. 
n.d. 
n.d. 



N51Q 



H-Ala-OH 
H-Val-OH 
H-Leu-OH 
H-Phe-OH 



0.99 t 0-01 
0.97 ± 0.02 
0.96 ± 0.01 
0.89 ± 0.03 



250 ± 15 
61 ± 2 
42 ± 1 
29 ± 1 



250 
63 
44 
33 



E6 5A 



H-Ala-OH 
H-Val-OH 
H-Leu-OH 
H-Phe-OH 



0.98 ± 0.02 
0.91 ± 0.01 
0.96 ± 0.01 
0.89 ± 0.03 



280 ± 20 

69 ± 2 

45 ± 2 

31 + 1 



290 
76 
47 
35 



E145A H-Ala-OH n.d. c n.d. c 7100 

H-Val-OH 0.48+0.01 79 i 6 160 

H-Leu-OH 0.30 + 0.01 40+2 130 

H-Phe-OH 0.31 ± 0.01 21 ± 1 68 



n.d. Not determined due to: 

b low fa values in the examined concentration range or 
c Ktf.mpp values much larger than N. 



In contrast to this, introduction of a serine 
at position 51 results in fa^, values that are reduced 
to approximately 50% of the wild-type value (Tables V 
and VI) . 
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TABLE VI 



Influence of the Substrate Leaving Group on 
fa »« 2111(1 Kji.w Values With H-Val-NH 2 as Nucleophile 
and CPD-Y and Various Mutants 



Enzyme Nucleophile £***x ^."pp 

(mM) (mM) 



Wild type FA-Ala-OBzl 0.98 ±0.01 1.7+0.1 1.7 

FA-Ala-Ala-OH 0.94 ±0.01 1.8 ± 0.1 1.9 

FA-Ala-Val-OH 0.22 ±0.01 1.5 ± 0.1 6.6 

FA -Ala -Leu -OH 0.16 ±0.01 1.6+0.1 9.9 

FA-Ala-Phe-OH 0.06 ±0.01 1.1+0.1 19.6 



N51S FA-Ala-OBzl 0.97 ±0.01 1.6±0.1 1.7 

FA- Ala -Ala -OH 0.88 ± 0.01 1.9 + 0.1 2.2 

FA-Ala-Val-OH 0.13 + 0.01 1.8 + 0.1 13.9 

FA- Ala -Leu -OH n.d.' n.d.' n.d. 

FA - Ala - Phe - OH n.d.' n.d.* n.d. 



N51Q FA-Ala-OBzl 0.96+0.01 2.1+0.4 2.1 

FA-Ala-Ala-OH 0.98 ±0.01 3.1 ± 0.1 3.2 

FA-Ala-Val-OH 0.91 ±0.01 2.4 ± 0.1 2.6 

FA -Ala -Leu -OH 0.80 ± 0.01 2.4 ± 0.1 3.0 

FA- Ala - Phe -OH 0.77+0.01 3.1+0.1 ?? 



E65A FA-Ala-OBzl 0.97 ± 0.01 4.2 ± 0.1 4.4 

FA- Ala -Ala -OH 0.96 ± 0.02 5.2 ± 0.5 5.5 

FA-Ala-Val-OH 0.67 + 0.01 5.7 + 0.3 8.5 

FA-Ala-Leu-OH 0.71 ±0.01 5.2 ± 0.1 7.4 

FA-Ala-Phe-OH 0.66 ±0.01 4.7+0.1 7.1 



E145A FA-Ala-OBzl 0.98+0.01 3.4±0.04 3.5 

FA-Ala-Ala-OH n.d. b n.d. b 

n.d. 

FA-Ala-Val-OH 0.80 ± 0.01 4.2 + 0.2 5.2 

FA-Ala-Leu-OH 0.65 ±0.01 4.1+0.1 6.3 

FA-Ala-Phe-OH 0.49 + 0.01 4.1 ± 0.1 8.4 



n.d. Not determined due to: 

* low fa values in the examined concentration range or 
b considerable degradation of the product. 
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This is consistent with the fact that no products are 
observed with FA-Ala-OBzl as substrate and valine, 
leucine or phenylalanine as nucleophile and with alanine 
fa^x wa s reduced from 0.91 to 0.56 (Table V) . Thus, 
5 with N51S the decreased fa^ values observed with 
FA-Ala-OBzl are also found with FA-Ala-Xaa-OH as 
substrate, hence repeating the pattern found with N51Q, 
i.e., that similar changes in fa^ are found with either 
leaving group. While not meant to be a limitation of 

10 the invention, this might be due to the serine in the 
mutant being shorter than the asparagine and subsequent 
steric changes in the active site that result in more 
rapid hydrolysis. The results from the aminolysis 
reactions are in good agreement with those obtained from 

15 steady state kinetics, which suggest N51 interacts with 
the C-terminal a-carboxylate group of peptide 
substrates . 

The characteristics of the mutant E61A are 
quite identical to those of N51Q. Hence, it exhibits an 

20 increase in fa^ in the reaction: FA-Ala-OBzl + H-Xaa-OH 
(Xaa = Val, Leu and Phe) -> FA-Ala-Xaa-OH + HOBzl to 
approximately 0.9 (see Table V) as well as in the 
reactions where these amino acids act as leaving groups. 
In the latter reactions, fa^ reaches 0.67, 0.71 and 

25 0.66 with -Val-OH, -Leu-OH and -Phe-OH, respectively. 
Again, we observe similar changes in fa^ with the same 
amino acid as leaving group/nucleophile, the same 
pattern as previously was found with the mutants on 
position N51 . 

30 With E145A, fa^ was increased from 0.22 to 

0.890 with valine as leaving group and from 0.16 to 0.65 
with leucine. But with valine as nucleophile, fa MX is 
only increased from 0.35 to 0.48, and with leucine 
reduced from 0.35 to 0.30. Thus, E145A does not show a 

35 similar increase in fa^ with the same amino acids as 

nucleophiles . In this way the characteristics of E145A 
deviate from those of N51S, N51Q and E65A. The fact 
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that with E145A the fa^ values in reactions with 
FA-Ala-OBzl and FA-Ala-Xaa-OH deviate is consistent with 
to E14 5 not being involved in the binding of the 
carboxyl group of an amino acid nucleophile. 
5 From these mutants it has been demonstrated 

that it is possible to alter the leaving group 
preference of CPD-Y, hence increasing its potential use 
in various transpeptidation reactions. 

10 EXAMPLE 4 

Alteration of Nucleophile Specificity 
By Mutant Carboxypeptidase 

The characterization of a series of 

15 mutationally altered derivatives of this 

carboxypeptidase Y (CPD-Y) has delineated the nature of 
the interaction between the C- terminal carboxylate group 
of the substrate and the enzyme. While not meant to be 
a limitation of the invention, it is believed that 

20 hydrogen bonds from the side chains of AsnSl and Glul45 
appear to be responsible for the binding of the 
C- terminal carboxylate group of peptide substrates. The 
peptidase activity of CPD-Y is optimal at acidic pH. 
CPD-Y also catalyzes the release of amino acid amides 

25 from peptide amides but this activity is optimal at 

basic pH. It is likely that at the basic pH range AsnSl 
interacts with the carbonyl oxygen of the C-terminal 
carboxyamide group while Glul45 in its deprotonated 
(carboxylate) form interacts with the -NH 2 group of the 

30 substrate. Glu65 is hydrogen bonded to AsnSl and Glu 
145 thereby orienting the two side chains involved in 
C-terminal recognition. When these amino acid residues 
are exchanged by site directed mutagenesis with amino 
acid residues without the capacity to function as 

3 5 hydrogen bond donors, the k c&z /K„ for the release of amino 
acids and amino acid amides from the C-terminus of 
peptides and peptide amides is drastically reduced. 
However, when other amino acid residues with such a 
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capacity are incorporated at these positions, much more 
activity is retained. 

In a number of cases, higher yields with 
nucleophiles containing a blocked a-carboxylate group, 
5 in particular amino acid amides, would be highly 

desirable. This problem can be approached by protein 
engineering. In particular, one could imagine that such 
changes might improve the synthetic capacity of the 
enzyme by (a) permitting the use of nucleophiles which 

10 are poorly accepted by the wild- type enzyme, (b) 

securing better binding of nucleophiles, (c) providing 
higher yields, and (d) reducing the rate of degradation 
of products. We here report the effects of such 
mutations on transacylation reactions with amino acid 

15 derivatives as nucleophiles. Some amino acid 

substitutions made in mutant proteases were made knowing 
that they were not likely to enhance yields of 
transacylation, but rather to study the mechanism of 
action of binding and catalysis with nucleophilies 

20 containing a blocked or-carboxylate group. 

Carboxypeptidase-Y was obtained from 
Carlbiotech, Copenhagen, Denmark. H-Val-OPr, H-Val-OBu, 
H-Val-NHCH 3 and H-Val-NHC 2 H S were from Peptech, Sydney, 
Australia. All other amino acids and amino acid 

25 derivatives were from Bachem, Switzerland. The mutants 
Asn 51 -* Gly, Cys and Val in the structural gene for 
CPD-Y and subsequent expression and purification of the 
mutants N51G and N51C were carried as described in 
Example 1. The purity of the enzymes was ascertained by 

30 SDS polyacrylamide gel electrophoresis. The preparation 
of the mutants N51A, N51D, N51T, N51Q, N51S, E145A and 
E14 5D has previously been described. The mutations 
AsnSl -* Cys, AsnSl -> Gly and AsnSl Val were 
constructed. However, N51V was not expressed and, thus, 

35 only N51C and N51G were isolated. FA-Ala-OBzl, 

FA-Phe-Ala-OH and FA-Phe-Leu-OH were synthesized by 
standard methods . 
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All enzymatic activities toward FA-substrates 
were determined spectrophotometrically at 329-337 nm 
using a Perkin Elmer lambda 7 or lambda 9 
spectrophotometer thermostated at 25°C. The hydrolysis 
5 was carried out in 0.05 M Mes # 1 mM EDTA, pH .6.5 for 
peptide substrates and in 0.05 M Hepes, 1 mM EDTA, 2.5% 
(v/v) , pH 7.5 for ester substrates, k cat and K. values 
were determined using the Enzfitter program. 

Among the previously described CPD-Y 

10 derivatives, mutationally altered at positions 51, 65 
and/or 145, a number were chosen to be investigated for 
their applicability in peptide synthesis. For the study 
of aminolysis reactions, the excellent CPD-Y substrate 
FA-Ala-OBzl was selected and initially the ability of 

15 all the CPD-Y derivatives to hydrolyze this substrate 
was investigated (data not shown) . With most mutants 
the hydrolysis of FA-Ala-OBzl was only slightly affected 
by replacement of AsnSl. In fact, with a number of 
mutants, elevated k^/Kn values were observed. With 

20 E65A, N51G and, in particular, E145Q, reduced k„ t /K„ were 
observed, possibly due to the effects of minor 
conformational changes. However, apart from this 
result, it appears that the interaction between enzyme 
and the -OBzl leaving group, in spite of its bulkiness, 

25 is not adversely affected by mutational replacements at 
positions 51, 65 and 145. Thus, provided that these 
enzymes may bind nucleophiles in a productive mode, they 
should be able to catalyze aminolysis reactions. 

The reactions with the valine amides were 

3 0 investigated. With H-Val-NH 2 , the fraction undergoing 
aminolysis reaction increased with increasing 
concentration of nucleophile until it reached 1.00. The 
results were analyzed according to a model which assumes 
that nucleophile binds to the acyl-enzyme intermediate 

3 5 prior to the deacylation reaction. The correlation 
between fa and the concentrations of nucleophile are 
consistent with K„ Up p, and values of 16 mM and 1.00, 
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respectively. Thus, the hydrolysis reaction is 
completely excluded by high concentrations of 
nucleophile. With the corresponding free amino acid, 
H-Val-OH, the values for K* im) and fa^, as determined 
5 under identical conditions, were 13 mM and .0.32, 

respectively. Thus, compared to H-Val-NH 2 , the binding 
of the free amino acid is much looser and, when it is 
bound, the binding mode evidently is much less favorable 
for the aminolysis reaction. The beneficial effect of 

10 blocking the carboxylate group is presumably due to 

elimination of the adverse effects of charge repulsion. 

The contribution of AsnSl and Glul45 to the 
productive binding of amino acid amide nucleophiles was 
investigated by determination of K^^p, and fa MX with 

15 H-Val-NH 2 using CPD-Y derivatives mutationally altered at 
positions 51 and 145 {see Table VII) . 
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TABLE VII 

Use of CPD-Y Mutants for Aminolysis of 
FA-Ala-OBzl using H-Val-NEL, and N-alkyl 
Derivatives Hereof as Nucleophiles 

Nucleophile 



Enzyme H-Val-NH 2 H-Val-NH-CH 3 



Wild-type 


(app) 


1 
0 


. 6 
. 97 


± o 
± o 


. 1 
.01 






0 


290 
.80 


± 50 
± 0.05 


N51G 


^NUpp) 


0 


.89 


12 

± o 


.02 


2 




0 


.76 


J D U 3 \J 

± 0.07 


N51A 




1 


. 00 


± o 


.01 


T 

X 






> 


> ±4 uu 

0.8 


N51S 




1 


. 00 


± o 


.01 


n 
\j , 


> X 


0 


.79 


*5 £ A j. ^ O 

± 0.05 


N51C 
100 


^NUppl 








J. 

X 










izuu ± 




fa-. 


0 


.95 


± o 


.01 






0 


.72 


±0.04 


N51D 




0 


.95 


11 

± o, 


± 

.01 


1 




0 


.70 


510 ± 40 
± 0.03 


N51Q 




0, 


,97 


1.5 

± o, 


.01 


0. 


1 








E145A 


^Ntapp) 


0 . 


. 98 


3.1 

± o. 


± 

.01 


0. 


1 


0. 


.97 


140 ± 13 
±0.03 


E145Q 


fa-. 


0 . 


, 08 


8.4 

± o. 


± 
01 


1. 


0 








E145D 


KfiUpp) 


0 . 


98 


2.3 

± o. 


I 
01 


0. 


1 


0. 


59 


580 ± 30 
±0.02 


E65A 


^N(app) 


2 . 
0 . 


9 

96 


± o. 

± 0. 


3 

01 












N51A+E145A 




0 . 


95 


100 

± o. 


± 
02 


7 




0. 


98 


640 ± 40 
± 0.03 


E65A+E145A 


Kniappi 


0 . 


76 


110 
± 0. 


± 
01 


3 




0. 


33 


47 + 4 
± 0.01 
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With N51A and N51G, the K N(app , values were increased 8-20 
fold and the fa^ values remained high. While not meant 
to be a limitation of the invention, it is believed 
these results demonstrate that the capacity of the group 
5 to donate a hydrogen bond is not required to properly 
orient the amino acid amide prior to the nucleophilic 
attack on the acyl -enzyme intermediate but it is 
apparently important for the binding of the nucleophile. 
These results are consistent with AsnSl functioning as 

10 hydrogen bond donor with the carbonyl oxygen of the 
carboxyamide group of the nucleophile as acceptor. 
AsnSl can be replaced with other hydrogen bond donors 
without impairing this wild-type enzyme. However, the 
AsnSl -* Cys mutation caused a significant increase in 

15 Ktfuppi ar *d this is consistent with hydrogen bonds 

involving Cys being rather weak. With N51D, the value 
for K NUpp , was also significantly elevated but this result 
is difficult to interpret since it is possible that 
AspSl may interact with -NH 2 group of the nucleophile. 

20 With E145A and E65A, the values for K^^,, were 

only doubled and the fa^ value remained unchanged. 
Thus, compared to AsnSl, Glul45 and Glu65 appear to be 
much less important for the interaction with amino acid 
amide nucleophiles . However, the possibility existed 

25 that the remaining glutamic acid, which is negatively 
charged at pH 7.5, could function in the binding of 
amino acid amides. To investigate this, the double 
mutant E65A-E14 5A was tested and it was found that K N(app) 
was drastically increased while fa^ was somewhat 

30 reduced. Thus, the absence of both glutamic acids is 
detrimental to the binding of amino acid amides but the 
presence of one of them is sufficient to secure tight 
binding. In the wild-type enzyme, Glul45 is negatively 
charged while Glu65 is uncharged and, accordingly, 

35 Glul45 is the one interacting with amino acid amides. 
The very low fa^ and significantly elevated K^pp, 
obtained with E145Q confirm the significance of Glul45 
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in the interaction with amino acid amides. Most likely, 
the C- terminus of peptide amides interact with the 
enzyme in an analogous way with the negatively charged 
Glul45 acting as a hydrogen bond acceptor. Since both 
5 Glu65 and Glul45 in the single mutants may function 'in 
this capacity there apparently may be some latitude 
concerning the length of the hydrogen bond. This is 
suggested by the fact that shifting the carboxylate 
group at position 145 one carbon atom away, i.e. E145D, 

10 has very small effects on both parameters. The high 
Ktuapp) obtained with the double mutant N51A+E145A and 
H-Val-NH 2 as nucleophile show that the effects of the 
AsnSl -* Ala and Glul4 5 -* Ala mutations are approximately 
additive but the fa max remained high. 

15 Valine N-methyl amide (H-Val-NH-CH 3 ) binds much 

less efficiently to CPD-Y than the unblocked H-Val-NH 2 . 
However, the fa^ is almost as high (0.80). 
Substitution of AsnSl for Gly, Ala, Ser, Asp or Gin and 
Glul45 for Ala, Asp or Gin only affected the values for 

20 fa^ and K NUpp) moderately, suggesting that neither Asn51 
nor Glul45 is involved in the interaction with H-Val-NH- 
CH 3 . The binding mode of this nucleophile, therefore, 
remains unclear. 



25 EXAMPLE 5 

Altering Substrate Preference of 
Carboxypeptidase Y by a Novel Strategy of Mutagenesis 

Selected targets of PRC1 were mutagenized 
30 randomly and subsequently screened for mutants 

expressing carboxypeptidase Y (CPD-Y) with increased 
activity toward poor P A substrates. From an alignment of 
the primary structures of CPD-Y and a wheat 
carboxypeptidase (CPD-W) (Breddam et al., Carlsbera Res . 
35 Commun . , 52:55-63; 65-71 and 297-311 (1987)) as well as 
the crystal structure of CPD-W (Liao et al., J. Biol. 
Chem. , 215:6528-6531 (1990), it was predicted which 
amino acid residues might constitute the surface of the 
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CPD-Y S x binding pocket. The corresponding codons of 
PRC1 were chosen as targets for saturation mutagenesis. 

The basic technique of the mutagenesis was in 
vitro DNA synthesis primed by mutagenic (degenerate) 
5 synthetic oligonucleotides using single -stranded 

phagemid DNA as template, followed by transfection of 
Escherichia coli . To facilitate efficient screening of 
a large population of mutant transf ormants, a new and 
generally applicable mutagenesis strategy was developed 

10 which eliminates the wild-type background due to 

unmutagenized plasmids in the subsequent functional 
screens. Although several existing procedures are 
highly efficient with respect to the frequency of 
mutagenesis, a residual portion of the plasmids produced 

15 will still be unmutated. If the mutation frequency is, 
say, 90%, and if 10 s transf ormants are produced, then 10 4 
will be wild type. After transformation of yeast, the 
wild-type transf ormants can make it very difficult to 
select the desired mutants in the subsequent screens. 

20 To overcome this problem, combinations of stop codons 
and restriction sites were introduced at each position 
in the PRC1 gene that was to be targeted by a degenerate 
oligonucleotide. The stop codons ensure that the e.g. 
10% unmutated plasmids will encode inactive CPD-Y, while 

25 the introduced restriction sites allow us to monitor the 
mutation frequency. The mutagenesis itself is performed 
according to the protocol of Lewis et al . ( Nucleic Acids 
Res . , 18:3439 (1990)) on single-stranded phagemid DNA 
carrying a frameshift mutated ampicillin resistance gene 

3 0 and the nonsense mutated prcl gene. Mutant strand 

synthesis is primed in a single reaction with a number 
of degenerate oligonucleotides to produce mutations in 
the PRC1 gene and an oligonucleotide that repairs the 
ampicillin resistance gene. The resulting DNA is used 

35 to transform a mismatch- repair deficient E. coli strain 
to ampicillin resistance. Propagation in ampicillin- 
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containing medium selects for progeny of the mutant 
strand. 

Two prcl deletion strains of yeast were 
transformed with the plasmid population. In one strain, 
5 CPD-Y enters the vacuole, while in the other strain CPD- 
Y is missorted, secreted and activated extracellularly 
due to a vps mutation as described by Nielsen et al . , 
Appl. Microbiol. Biotech. , 33 :307 (1990) . 

The vps strain, which requires leucine for 

10 growth, was used to search for desired mutants in a 

single direct screen. The transformed cells were plated 
on synthetic medium lacking leucine but containing one 
of various N-blocked-X-Leu-OH dipeptides. Only cells 
which express a protease that can release the terminal 

15 leucine can grown on these plates. 

The Vps* strain was used to screen for desired 
mutants in two steps. First, transformant colonies were 
overlaid by agar containing a chromogenic substrate 
which reveals CPD-Y activity. In the second step, the 

20 CPD-Y activity from the positive colonies was estimated 
towards different substrates in a chromogenic microtiter 
dish assay. 

By this procedure, a total of >10 s independent 
mutants was produced, some of which exhibit increased 
25 activities (k CJC /KJ toward certain substrates by a factor 
of up to 150. A number of these mutant plasmids have 
been sequenced and the encoded enzymes have been 
purified and characterized kinetically. 

30 Reagents : 

CBZ-X-Leu-OH peptides and N-acetyl-L-alanine 
0-naphthyl ester (AANE) were from Bachem; horseradish 
peroxidase type I, Crotalus atrox L-amino acid oxidase 
type VI, o-dianisidine and Fast Garnet Red GBC salt were 
35 from Sigma. Oligonucleotides were synthesized on an 
Applied Biosystems 3 94 DNA-RNA Synthesizer. LB, 2xYT 
and SOC medium were prepared according to Sambrook et 
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al., Molecular Cloning. A Laboratory Manual . Cold Spring 
Harbor, NY (198 9) . SC and YPD medium were prepared 
according to Sherman, Methods Enzvmol . , 194:3-21 (1991), 
with slight modifications (Tullin et al., Yeast , 7:933- 
5 941 (1991) ) . 

Strains : 

E. coli BMH71-18mutS (thi supE A(lac-proAB) 
[mutS::TnlO] F' [proAB* laql q lacZAMIS]) (Kramer et al . 
Cell . 18:879 (1984); Zell et al. EMBO J . , 6:1809 

(1987) ); E. coli JM109 (recAl supE44 endAl hsdR17 
gyrA96 relAl thi A(lac-proAB) F' [traD36 proAB* laql q 
lacZAMIS]) (Yanisch- Perron et al . , Gene , i3:103 (1985)); 
S. cerevisiae JHRY20-2CA3 (MATa Aprcl leu2-3 leu2-112 
ura3-52 his3-A200 prcl - A3 : : HIS3 ) (Blachly-Dyson et al . , 
J. Cell Biol . , 104:1183 (1987)); S. cerevisiae W2579 
(MATa Aprcl leu2-3 leu2-112 ura3-52 vpll-1) (Nielsen et 
al. (1990) cited supra . ) . The vpll gene has been 
renamed vpsl (Robinson et al., Mol . Cell . Biol . . 8:4936 

(1988) ). K2579LLR was isolated in the present study as 
a spontaneous mutant of W2579 that requires less leucine 
for growth. 

Plasmida; 

25 The 2.2 kb EcoRI 2\l fragment taken from YEp24 

(Botstein et al., Gene . £:17-24 (1979)) was blunted and 
inserted into the blunted Clal site of pSELECT (Lewis et 
al., 1990 cited supra . ) in the orientation that places 
the HINDIII site close to lacZ. The 1 . 1 kb Hindlll URA3 

3 0 fragment taken from YEp24 was blunted and inserted 

clockwise into the blunted Styl site of the resulting 
plasmid to produce pYSEL. The Sall-PvuII fragment of 
pWI3 (Winther et al . , Eur. J. Biochem. . 179:681 (1991)) 
with the PRC1 gene under control of its own promoter was 

35 blunted and inserted clockwise into the blunted Sall- 
BamHI fragment of pYSEL to produce the phagemid shuttle 
vector pYSPl. A combined stop codon/restriction site 



15 
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was introduced into pYSPl at each of the four selected 
targets for mutagenesis of the PRC1 gene {Table VIII) . 
The resulting plasmid, pYSP32, carries ori, tet\ bla~, 
the fl ssDNA replication origin, a functional fragment 
5 of 2/i, URA3 and prcl. Single-stranded DNA produced from 
this phagemid is complementary to the sequence indicated 
in Table VIII. 



WO 95/20039 



PCT/US95/06682 



73 



H 
H 
> 

W 

m 



0) 

c 

M 

Tl 0 



2 5 
■So 



8- 
g. * 

® c 
p 

2 



o 
o 



u 
u 
< 

H H 



CD 



o a « 

m < h 

U 



H 
O 
H 

vo H D) 
H O U 

(N U < 

H <H 
CM O O 

o 

H 



U 
< 



U 
H 
U 

O 

o 
o 



r4 

x: 
u 
o 



u 
u 
< 

< 

o 

u 

< 



H 
O 
H 

O 

O l"H 

O m 



u 
u 
u 

o 



E 
CO 



H 

U 
U 

H 
as O 

an 

o • 



CO 

© 
*0 



0 
0) 
«H 
U 

c 

0 
0) 



OOOOi— iooo 
lo r-t h in 



UUUUUUUU 
UUUUUUUU 
<<<<<<<< 




UUUUUUUU 



+ + + + + + + + 



OOUUOOUU 



U 2 U 



U 



OOUUUUOO 



+ + + + + 



UUUUUUUU 
<<<<<<<< 



UUUUUUUU 
UUUOUOUU 
UOUUUUUU 







U 




u 




o 


UUUUUUUU 






u 




u 






UUUUUUUU 


>■ 




u 






»— ( 


c 


UOOOOUUU 


a, 










1— 1 


0J 


u 




u w 








0) 


ujsouuuuu 










U 


T3 


03 










m 


O 


C 


JJ 




c 






a, 




-H 


3" 






CC 


CO 

>< 


o> < 
^ < 




e 


UUUUUUUU 
UUUUUUUU 


rH 




H 


a 






M-l 




u 








o • 




0 


o: 






U-l 


0 • 








a 






O 






c 

0 




<u 






0) 






-w 




a 






c 






XJ 




5s 






0) 






(d 




jj 






Ol 






C 


ajwcowmcnrocn 

0)0)0)0)0)0)0)0) 








iH 








-H -H -H -H -H -H -W -H 








u 






I 




Pl-H 






in 






0 


0)0)0)0)0)0)0)0) 


O 2 






a 






tj 


cocococococococo 



0) 


0 


jC 




JJ 


Q> 


iJ 


P* 

0) S 


nj 


U jJ 




a) 


01 






o 


CO 




a) 


0) a) 




n 


jj 


m E 




jO 0 






00 


a> vm 
jj 


a) 


at oo 


TJ 


u u 


■H 


(U c 


JJ 


C fO 


0 


a) £ 


0) 


o> ^ 


rH 


a) o 


O 






0) 


C 


a) c 


0 


£ rn 






Jj 




T3 


O 


C JJ >. 




ro in to 




<T3 01 


c 


Oi <l) o] 




c >« <0 


}-i 


0 




HIW >, 


TJ 


O flJ 




0) — t 


XI 


0) C 


<U 


CO O QJ 




ro > 




ja jj O 




L) 


to 


m aj u 


(d 


m V-i 0 








0) O 


10 


U 4) O 


0) 


0) £ 


*o 


3 JJ 0) 


-H 


jr 


XJ 


OQ 0) JJ 


0 


0) 0) 


0) 


XJ JJ c 


rH 


*H CO **H 


O 


JJ u 


3 


O "H >, 


C 


O T3 JJ 




H C'H 




u > 


3 


3 --h 


O 


CCu 




O E O 




Ol 3 ro 


rH 


-W rH 


rH 


rH O 0) 


CO 


O O 






*4 


U JJ ro 


O £ D 




C Ol O 


a> 


flJ-H CJ 


u 


Ol w JJ 


3 


fO 0) 


JJ 


jJ ft) T3 


X 


3 J= 


•H 


Eh£ 


£ 


jj 




r-t -H 


«-H 


rH - ^ 




< W 


& 


O >* 




JJ < 


0) 


• c a 




c « a 


c 


0 u o 


(0 


-#H 




JJ 0) to 


JJ 






CI JJ 0) 


J3 


O U 


jJ 


a jj a 




m X 


0) 


T) ft) 


0) 


O TJ 


JJ 


jJ ft) ft) 


nj 


(U JJ X 


u 


U CO JJ 


-•H 


•H 3 


TJ 


TJ JJ 2 


C 


C -H 5 


*H 


H OJ Q 



WO 95/20039 PCT/US95/06682 



74 

Trans f ormation : 

Transformation of E. coli was performed with a 
Bio-Rad Gene Pulser set at 25 tiF , 200 Q and 2.5 kV in 
2 mm cuvettes according to Dower et al., Nucleic Acid 
5 Res . , 16:6127 (1988). Transformation of yeast was 

performed according to Schiestl and Gietz, Curr . Genet . . 
16:339 (1992) . 

Preparation of single- stranded phagemid DNA: 

E. coli JM109 transformed with pYSP32 was 
grown to an OD fi00 of 0.5 in 2 x YT + 50 mg/1 
tetracycline. One milliliter of this culture was 
superinf ected with 20 pi of a >10 9 p.f .u./ml M13K07 
helper phage stock in a 500 ml Erlenmeyer bottle. After 
incubation for 1 hour at 3 7°C, 200 ml 2 x YT + 50 mg/1 
tetracycline + 50 mg/1 kanamycin was added. After 
incubation with agitation overnight at 37°C / ssDNA was 
purified by standard procedures (Sambrook et al., A 
Guide to Molecular Cloning (1989) ) . Kanamycin selects 
for cells superinf ected with helper phage M13K07. 

Mutagenesis ; 

Mutagenesis was performed according to the 
protocol of Lewis et al. (1990) (cited supra ) with 
25 slight modifications. Single-stranded pYSP32 (0.2 pmol) 
was mixed with 0.5 pmol ampicillin-repair 
oligonucleotide (Promega, Altered Sites Kit) and 2 pmol 
of each mutagenic degenerate oligonucleotide in 80 ^1 2 
x annealing buffer (20 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 
30 50 mM NaCl) , heated to 70°C for 5 minutes and allowed to 
cool slowly (- 1 hour) to room temperature. After the 
annealing reaction 10 /-il 10 x synthesis buffer (100 mM 
Tris-HCl pH 7.5, 5 mM of each of the four dNTPs, 10 mM 
ATP, 20 mM DTT) , 10 Weiss units T4 ligase (New England 
35 Biolabs) , 20 units T4 DNA polymerase (Promega) and H 2 0 to 
100 fil were added. The polymerase/ ligation reaction 
was incubated at 3 7°C for 2 hours. Subsequently the 
reaction mixture was phenol extracted, ethanol 
precipitated, resuspended and used to transform electro- 



15 
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competent E . coj-i BMH71-18mutS cells. After incubating 
the transformed cells for 45 min at 37°C in SOC, an 
aliquot was plated on LB plates with 60 mg/1 ampicillin 
to determine the number of transf ormants , while the rest 
5 of the cells were grown overnight in 50 ml LB with 60 
mg/1 ampicillin. A plasmid preparation from this 
primary culture will contain up to about 50% ampicillin 
sensitive plasmids originating from the unmutated parent 
strand. To enrich for the mutants, l miniprep DNA 
10 from this culture was used for a secondary 

transformation of JM109. One microgram of miniprep DNA 
from the JM 109 transformant culture was used to 
transform the yeast strains K2579LLR and JHRY20-2CA3 . 

DNA sequencing was performed by the Applied 
Biosystems dsDNA Taq DyeDeoxy 1 " terminator procedure for 
use with the Applied Biosystems Model 373A DNA 
sequencing system. 

20 

Plate act ivity screen t 

Yeast strain K2579LLR was transformed with the 
mutated population of pYSP32 and plated on SC-ura-leu 
supplemented with 1.5 mM of a poor CBZ-X-Leu-OH peptide 

25 substrate as an enzyme -dependent leucine source. P x of 
the substrate was either Glu, Gly, Ser, His, Pro, Trp or 
Lys. The cells can grow on this medium only if they 
express proteases capable of releasing the C-terminal 
leucine. However, CPD-Y is not the only protease 

30 secreted by the cells that can catalyze this cleavage. 
Thus, a minimum level of CPD-Y activity is necessary not 
only to release sufficient leucine for growth, but also 
in order to discriminate CPD-Y dependent growth from the 
background growth. Based on experiments with 

35 characterized CPD-Y mutants, it is estimated that the 
kcat/Km'«»ust be above -10,000-25,000 min^mM' 1 , dependent 
on the substrate. 
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Color overlay screen: 

Yeast strain JHRY20-2CA3 was transformed with 
the mutated population of pYSP32 and plated on SC-ura 
plates, which were incubated at 3 0°C until colonies 
5 reached a diameter of 2-3 mm. Then each plate was 

overlaid with a fresh mixture of 3 ml 0.6% agar in water 
at 50°C and 2 ml dimethylf ormamide containing 0.25% AANE 
at room temperature. After incubation for 5 min at room 
temperature, 5 ml 0.4% Fast Garnet Red GBC salt in 10 mM 

10 sodium phosphate pH 7.0 buffer was poured on top; after 
5 min incubation the plates were then rinsed in tap 
water. Colonies expressing active CPD-Y appear red, 
while those lacking vacuolar CPD-Y activity appear white 
(modified from Jones, 1977) . Colonies expressing active 

15 CPD-Y were isolated by streaking onto SC-ura plates. 

Coupled assay screen; 

After incubation for 3-4 days on SC-ura plates 
of JHRY20-2CA3 transf ormants expressing active mutant 

20 CPD-Y, -1 mm 3 of cells was picked with a toothpick and 
suspended in 200 /il water. After aliquoting 20 ^1 cell 
suspension into each of eight microtiter wells, 100 pi 
substrate mixture (50 mM MES pH 6.5, 1 mM MnCl 2 , 50 fig /ml 
peroxidase, 100 fig /ml L- amino acid oxidase, 100 fig /ml o- 

25 dianisidine and 1 mM CBZ-X-Leu-OH) were added. P 1 of the 
substrate (X) was either Phe, Glu, Gly, Ser, His, Pro, 
Trp or Lys. The plates were incubated at room 
temperature, and OD 560 was measured at times 0, 1 and 16 
hour on a Perkin Elmer Lambda Microplate Reader 

30 (modified from Lewis and Harris, 1967, cited supra ) . 

The activity of each mutant was normalized to 
that of the wide type as follows. First, all absorption 
values were corrected by subtracting the background 
(absorption at same time point in wells without cells) . 

35 The difference in correction absorption between two time 
points (corresponding to the amount of hydrolysis) was 
then normalized to account for variations in cell 
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number, by dividing by the OD of the cells (OD of well 
with cells at time point 0 minus OD of well without 
cells at time point 0) . Finally, the obtained activity 
estimate was divided by the corresponding estimate for 
5 the wild type. Time point 0 and 1 hour were used to 
calculate the relative activity towards CBZ-Phe-Leu-OH, 
while time points 0 and 16 hour were used to calculate 
the relative activity towards all other substrates. 

10 Isolation and kinetic characterization 
of mutant forms of CPD-Y r 

Over- express ion of selected mutant enzymes was 

performed according to Nielsen et al. (1990) cited 

supra . The mutant BamHI-BamHI PRC1 fragments of pYSP32 

15 were inserted into the GAL expression vector pRA21 and 
introduced into yeast strain K2579LLR. The plasmid 
pRA21 is derived from p72UG (Nielsen et al. (1990) cited 
supra ) by replacing the 918 bp Bglll-Sall fragment with 
the €38 bp Bglll-PvuII fragment of pWI3, thereby 

20 deleting the BamHI site downstream of the PRC1 gene. 
Mutant forms of CPD-Y were purified from culture 
supernatants by affinity chromatography as described in 
Example 1 . Kinetic parameters were determined by 
measuring rates of hydrolysis at 25°C of CBZ-X-Leu-OH 

25 substrates spectrophotometrically at 224 nm in 10 ttiM 
sodium phosphate buffer pH 6.5. The concentrations of 
substrates ranged from 0.01 to 0.5 mM. Accurate 
measurements were not possible with substrate 
concentrations exceeding 0.5 mM due to the high 

30 absorption of the substrate. 

By studying the crystal structure of the wheat 
carboxypeptidase CPD-W (Liao et al (1990) cited supra ) . 
the amino acid positions Pro60, Tyrl56, Leul87, Phe224, 
Ile225, Val334 and Val335 were found to constitute the 

35 bottom, end and sides of the S x binding pocket of 

carboxypeptidase W, These positions of CPD-W correspond 
to amino acid positions Pro54, Tyrl47, Leul78, Glu215, 
Arg216 p Ile340 and Cys341 of CPD-Y according to an 
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alignment of the primary structures of the two enzymes 
(Breddam et al. (1987) cited supra Sorensen et al., 
Carlsbera Res, rnmmnn , 52:285 (1987)). We hypothesized 
that these amino acid residues constitute the S x binding 
5 pocket of CPD-Y. As Pro60 contributes very little to 
the surface of S 1 of CPD-W, Pro54 of CPD-Y was not 
included in any of the mutagenesis experiments, and as 
. Tyrl56 is next to the active Serl55 of CPD-W and its 
contribution to the surface of S 1 is also minor, Tyrl47 
10 of CPD-Y was only included in one mutagenesis 
experiment . 

The crystal structure of yeast 
carboxypeptidase Y (Endrizzi et al.. Biochemistry . 
33:11106 (1994)) confirms that this model is correct in 
15 many of its features. The model correctly predicts 4 of 
9 residues in the S a binding site. Pro54 of 
carboxypeptidase Y is predicted to be at most a marginal 
part of the S x binding site; it is in fact absent from 
the S x site. The model that used the crystal structure 
20 of wheat carboxypeptidase -W and the sequence of yeast 
carboxypeptidase Y yielded accurate guidance for 
construction of mutants of carboxypeptidase Y. 

Eight different series of mutagenesis of 
pYSP32 were carried out, randomly mutating various 
25 numbers of codons, from 1 to 6, simultaneously (Table 
VIII) : series 1, all codons reverted to wild type; 
series 2, positions 147, 178, 215, 216, 340 and 341 
degenerated; series 3, positions 178, 215 and 216 
degenerated; series 4, positions 178, 340 and 341 
30 degenerated; series 5, positions 215, 216, 340 and 341 
degenerated; series 6, position 178 degenerated; and 
series 8, positions 340 and 341 degenerated. In all 
series, all mutant codons present in pYSP32 that we not 
subjected to random mutagenesis were reverted to wild 
35 type (Table VIII) . All degenerate mutagenic 

oligonucleotides were designed relatively long (33 
bases) to minimize biased annealing of oligonucleotides 
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complementary to the introduced stop codons and 
restriction sites. 

A sensitive two-step screening strategy was 
employed. First, to differentiate transf ormants 
5 expressing active CPD-Y , we have used a modification of 
the overlay procedure of Jones (1977) for staining CPD-Y 
containing yeast colonies. Instead of using APNE 
(N- acetyl -DL-phenylalanine 0-naphthyl ester) as a CPD-Y 
substrate, we have used AANE, since we find that the 

10 background staining of inactive CPD-Y transformants is 
much lower with AANE, thus making it possible to 
identify transformants with low CPD-Y activity. 
Furthermore, it was anticipated that most active CPD-Y S x 
mutants would accommodate the smaller P x side chain of 

15 AANE better than the larger side chain of APNE. 

Colonies of yeast strain JHRY20-2CA3 
transformed with DNA from all mutagenesis series were 
screened by this assay. In the wild type control 
mutagenesis, series 1, half of the transf ormant colonies 

20 express active CPD-Y (Table VIII) . If the mutagenesis 
event at each oligonucleotide target is independent of 
that of the other three targets, this number corresponds 
to a mutation frequency of 84% at each target. Of 10 s 
tested transformants from mutagenesis series 2, none had 

25 detectable CPD-Y activity. In this series, six codons 
were mutated simultaneously. Fewer codons were mutated 
in series 3-6 and 8, which yielded between 0.4 and 10% 
positive transformants, expressing a wide range of CDP-Y 
activities as indicated by the color intensities in the 

30 overlay assay. Mutagenesis series 7 yielded 50% 

transformants expressing active CPD-Y and the level of 
activity of all transformants was indistinguishable from 
that of the wild type transformants, suggesting that 
positions 215 and 216 can be varied with little effect 

35 on activity towards AANE. 

In the second step, we estimated the CPD-Y 
activity of positive mutants toward eight N-blocked 
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dipeptides (listed in Table IX) , by a modification of a 
chromogenic microtiter dish assay, involving L-amino 
acid oxidase, peroxidase and o-dianisidine, previously 
used to monitor carboxypeptidase S activity (Lewis and 
5 Harris, 1967; Wolf and Weiser, 1977) . 
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TABLE IX 



Estimated GPD-Y Activities of Selected Mutants, 
Relative to Wild Type Transf onaants. Towards 
an Initial Concentration of 
lmM of Eight CBZ-X-Leu-OH Substrates 



Pi side chain of substrate 



Lys 


Phe 


Glu 


Glv 


Ser 


Pro 


Trn 


His 


Wide- type CPD-Y 


1 


1 


1 


1 


1 


1 


1 1 


Mutations : 
















178Ser • 


0 . 7 


1.3 


1 . 1 


Q . 6 


1 . 8 


1 . 2 


1 . 1 


9.2 
















178Cys 


1.5 


1.1 


0.9 


0.5 


0.3 


1.0 


1.7 


6.9 
















178Cys/215Thr/2i6Trp 


1.0 


1.1 


0.8 


0.4 


1.6 


0.9 


1.1 


5.3 
















178His/215Pro/216Thr 


0.9 


1.0 


0.8 


1.1 


1.2 


1.5 


1.1 


3.6 
















178Ala/215Ser/216Arg 


1.2 


1.4 


1.1 


1.3 


0.6 


1.1 


1.0 


4.0 
















178Ser/215Ala/216Pro 


1.5 


2.0 


0.6 


0.7 


2.8 


1.9 


0.9 


8.0 
















178His/215Thr/216Phe 


0.9 


1.2 


0.8 


1.0 


1.2 


1.7 


1.1 


3.1 
















178Asn/215Ala/216Gly 


2.8 


2.0 


0.7 


0.7 


0.9 


2.0 


1.3 


4.4 
















178Gly/340Ile/341Cys 


0.8 


0.8 


0.3 


0.3 


0.5 


0.5 


0.8 


4.6 

















Relative activities >2.0 are emphasized in bold print 
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Trans formants of yeast JHRY20-2CA3 expressing 
active CPD-Y from mutagenesis series 3-8 were tested in 
this assay using an initial substrate concentration of 
1 mM. Compared with the wild type, transf ormants from 
5 mutagenesis series 7 all exhibited activity levels 
around 100% towards all eight tested CBZ-X-Leu-OH 
substrates. Neither very low nor very high activity 
levels were observed, suggesting that positions 215 and 
216 have little influence on P x preference. 

10 Transf ormants from mutagenesis series 4, 5 and 8 

generally exhibited low and varied activity levels. 
Most of these exhibited between 10 and 100% CPD-Y 
activity and none exhibited >200% activity. We conclude 
that Ile340 and/or Cys341 are important for K^k^ or 

15 production of the enzyme. Transf ormants from 

mutagenesis series 3 and 6 showed a broad range of 
activities with an average around 100%. Several mutants 
were fund with very low activity levels, just as several 
were found with very high activity levels towards one of 

20 the substrates, CBZ-Lys-Leu-OH. In one case the 

activity was 9 times the activity of the wild type. To 
determine which amino acids had been substituted in a 
number of the most interesting mutants, plasmid DNA was 
recovered and sequenced (Table IX) . The mutant 

25 exhibiting the highest increase in activity towards CBZ- 
Lys-Leu-OH was found to have Leul78 substituted with 
Ser . 

To determine the kinetic parameters of the 
mutants exhibiting the highest increase in activity, 
30 CPD-Y was purified from the mutants 178Ser and 

178Ser/215Ala/216Pro by affinity chromatography and 
analyzed kinetically (Table X) . 
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TABLE X 

Kinetic Parameters of Two Selected Mutant CPD-Y Forms 
Compared with the Wild Type 



Substrate CPD-Y K,, K*z/** 

(mM/min/mM) (mM) (min'HnM* 



CBZ-Phe-Leu-OH 


Wild type 


13600 


0.036 


380,000 










178S 


12000 


0.060 


200, 000 










178S/215A/216P 


5600 


0.053 


110,000 








CBZ-Lys-Leu-OH 


Wild type 


nd 


nd 


24 










178S 


nd 


nd 


3,700 










178S/215A/216P 


nd 


nd 


2,200 








CBZ-Ser-Leu-OH 


Wild type 


800 


0.070 


12,000 










17BS 


420 


0.070 


6,000 










178S/215A/216P 


240 


0.080 


3,000 








CBZ-Ala-Leu-OH 


Wild type 


-25000 


-0.7 


36,000 










178S 


nd 


nd 


4,800 










178S/215A/216P 


nd 


nd 


2,400 








CBZ-Glu-Leu-OH 


Wild type 


nd 


nd 


440 










178S 


nd 


nd 


720 










178S/215A/216P 


nd 


nd 


500 








nd: Values could not 


be determined as 


K,, is >0.5 


mM. 
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The kcat/K,, values of these enzymes towards *CBZ-Lys -Leu -OH 
are 3700 and 2200 min^mM" 1 respectively, corresponding to 
a 155- and 91-fold increase relative to the wild type 
value of 24 min* 1 ^' 1 . These results confirm the 
5 importance of the 178Ser mutation and the smaller effect 
of changes in the side chains at positions 215 and 216. 

By introducing stop codons at all targets for 
random mutagenesis, the wild type background normally 
present due to unmutagenized plasmids was eliminated, 

10 and the wild type PRC1 gene can now only occur by rare 
mutational reversion. The frequency of such revertants 
will depend on the mutagenesis efficiency, on how many 
codons are mutagenized simultaneously and on the number 
of codons synonymous to the mutagenized codons. With an 

15 overall mutagenesis efficiency of 50%, theoretical 

reversion frequencies of 6 x 10' 9 , 1 x 10" 4 , 7 x 10" 5 , 2 x 
10" 6 , 5 x 10* 2 , 1 x 10° and 7 x 10" 10 would be expected for 
mutagenesis series 1-8 respectively. These low 
frequencies of wild types enabled selection for rare 

20 mutants with increased activity. Without the 

elimination of the wild type background, it would have 
been difficult to select the desired mutants from a pool 
of perhaps 0.5% active mutants and 50% unmutagenized 
wild type transf ormants . By this strategy the binding 

25 site of any enzyme that has a functional screen can be 
dissected, amino acid by amino acid, to determine which 
positions are productive mutagenesis targets. 
Furthermore, the introduction of a rare restriction site 
at the oligonucleotide target prior to the mutagenesis 

30 simplifies the task of confirming the mutation and 
reduces sequencing to a minimum. 

The results suggest that Leul78, Ile340 and 
Cys341 contribute to P x specificity, conceivably by 
forming part of the surface of the S x binding pocket. 

35 All CPD-Y mutants with increased activity towards Lys in 
P x were found to have mutations at position 178. 
Surprisingly, substituting Leul78 by Ser, Cys, Ala, Gly, 
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Asn or His, all give the same general effect - more 
activity towards substrates with Lys in P x (Table X) . An 
explanation for the increased activity towards Lys in P x 
of CPD-Y- 178 mutants might be that the introduced 
5 mutation causes a conformational change of whichever 

residues constitute the end of S x in CPD-Y, corresponding 
to 224 and 225 of CPD-W. 

CPD-Y can catalyze the transpeptidation of 
proline insulin (INS-Pro-Lys-Ala-OH) to produce human 

10 insulin amide (INS-Pro-Lys-Thr-NH 2 ) , and it has 

previously been shown that the Cys341 Hg 2 * modified 
enzyme, which has 15 times more activity towards Lys in 
P w produces 26% human insulin amide product in a 
transpeptidation reaction (Breddam et al. 1984), whereas 

15 the wild type enzyme only produced 4.2%. Thus, the 
178Ser mutant, with its 155-fold increase in activity 
toward Lys in P x , might give even higher yields in such a 
transpeptidation reaction. 

Mutants selected for the ability to hydrolyze 

20 a poor CBZ-X-Leu-OH peptide substrate as described 
herein can also be screened for the ability to 
transacylate preselected polypeptide substrates and 
nucleophiles as described in Examples 2 and 3 . 

All publications and patent applications in 

25 this specification are indicative of the level of 
ordinary skill in the art to which this invention 
pertains. All publications and patent applications are 
herein incorporated by reference to the same extent as 
if each individual publication or patent application was 

30 specifically and individually indicated by reference. 

It will be apparent to one of ordinary skill 
in the art that many changes and modifications can be 
made in the invention without departing from the spirit 
or scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Carlsberg A/S 

(B) STREET: Vasterf aelledve j 100 

(C) CITY: Copenhagen V 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-17 99 

(A) NAME: Breddam, Klaus 

(B) STREET: Sondervangsvej 28B 

(C) CITY: Glostrup 
(E) COUNTRY: Denmark 

<F) POSTAL CODE (ZIP) : DK-2600 

(A) NAME: Kielland-Brandt , Morten C. 

(B) STREET: Lundbyesgade 3 

(C) CITY: Copenhagen V 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-1771 * 

(A) NAME: Mortensen, Uffe Hasbo 

(B) STREET: Nyrnberggade 6, 3 TV 

(C) CITY: Copenhagen S 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2300 

(A) NAME: Olesen, Kjeld Ove 

(B) STREET: Nitiveg 9, 3th 

(C) CITY: Frederiksberg 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2000 

(A) NAME: Stennicke, Henning Ralf 

(B) STREET: Sindshvileve j 3 A, 4 TV 

(C) CITY: Frederiksberg 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2000 

(A) NAME: Wagner, Fred W. 

(B) STREET: Route 1, Box 77B 

(C) CITY: Walton 

(D) STATE: Nebraska 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 68461 

(ii) TITLE OF INVENTION: Customized Proteases 
(iii) NUMBER OF SEQUENCES: 31 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (Unknown) 

(B) FILING DATE: 27-OCT-1994 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/144,704 

(B) FILING DATE: 28-OCT-1993 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GLP-1 (glucagon- like peptide-1) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



His Asp Glu Phe Glu Arg His Ala Glu Gly Thr Phe Thr Ser Asp Val 
15 10 15 



Ser Ser Tyr Leu Glu Gly Gin Ala Ala Lys Glu Phe lie Ala Trp Leu 
20 25 30 



Val Lys Gly Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-43) -Ala (Growth hormone releasing factor) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Tyr Ala Asp Ala lie Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
15 10 15 

Leu Ser Ala Arg Lys Leu Leu Gin Asp lie Met Ser Arg Gin Gin Gly 
20 25 30 



Glu Ser Asn Gin Glu Arg Gly Ala Arg Ala Arg Ala 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GRF (1-24) (Growth hormone releasing factor) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Tyr Ala Asp Ala lie Phe Thr Asn Ser Tyr Arg Lys Val Leu Gly Gin 
15 10 15 



Leu Ser Ala Arg Lys Leu Leu Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51Q 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGATCCGGTC ATCCTTTGGT TGCAAGGGGG T 31 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



TAAGCTTCC 



9 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
TGAATTCCT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
TAGCCCGGGT GT 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
TAAATCGATA CC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
TCCTACGCC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGGAACGTT GT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTCATCTGTA CC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCCTCACT 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo wtll 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTTTCTGTCC TTGTGAGACA AAATTTCAGA 
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(2) INFORMATION FOR SEQ ID. NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo W49F 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GGATCCGGTC ATCCTTTTCT TGAACGGG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51A 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
'CAAAGGATC CGGTCATCCT TTGGTTGGCA GGGGGCCA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE; Oligo pUC19 spl 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16 
GCAAGGCGAT TAAGTTGGGT 
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(2) INFORMATION FOR SEQ ID. NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo E145A 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
GGCGTAGGAA GCCCCAGCGA T 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo E65A and olago wtl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

21 

CTGTTCTTTG CATTAGGACC C 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE : 

(B) CLONE: Oligo N51D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCAAAGGATC CGGTCATCCT TTGGTTGGAC GGGGGCCA 



38 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGATCCGGTC ATCCTTTGGT TGGAAGGGGG CCA 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51Q 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GGATCCGGTC ATCCTTTGGT TGCAGGGGGT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GGATCCGGTC ATCCTTTGGT TGAGTGGGGG T 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo N51T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 
GGATCCGGTC ATCCTTTGGT TGACTGGGGG T 



(2) INFORMATION FOR SEQ ID NO: 24: 

£i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo E145D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GGCGTAGGAA TCCCCAGCGA T 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25 
GGCGTAGGAA TTCCCAGCGA T 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo E145Q 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GGCGTAGGAT TGCCCAGCGA T 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Oligo E145S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GGCGTAGGAT GACCCAGCGA T 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TCCNNNGCC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
TCCNNNGCC 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GGCNNNACT 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
TTCNNNNNNA CC 
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WHAT IS CLAIMED IS: 

1. A customized protease derived from a known 
protease, wherein the customized protease has a 
modified active site and is capable of catalyzing a 
transacylation reaction in which a preselected 
substrate is modified with a preselected 
nucleophile, and wherein the same transacylation 
reaction is not substantially catalyzed by the 
known protease. 

2. The customized protease of claim 1, wherein the 
customized protease is capable of transacylating at 
the C-terminal end of the preselected substrate. 

3. The customized protease of claim 1, wherein the 
customized protease is capable of transacylating at 
the N-terminal end of the preselected substrate. 

4. The customized protease of claim 1, wherein the 
preselected nucleophile is a single or multiple 
amino acid unit. 

5. The customized protease of claim 1, wherein the 
customized protease is derived from a known 
exopeptidase . 

6. The customized protease of claim 1, wherein the 
customized protease is derived from a known serine 
or cysteine protease. 

7. The customized protease of claim 1, wherein the 
customized protease is derived from a known 
carboxypeptidase . 
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8. The customized protease of claim 1, wherein the 
customized protease is derived from a known 
endoprotease . 

9. The customized protease of claim 1, wherein the 
preselected substrate has a basic or acidic 
penultimate amino acid. 

10. The customized protease of claim 1, wherein the 
preselected nucleophile is selected from the group 
consisting of an acidic amino acid amide, a basic 
amino acid amide, a peptide amide, an amino acid 
and an amino acid ester. 

11. The customized protease of claim 1, wherein the 
preselected substrate corresponds to GRF (1-43) -Ala 
and the preselected nucleophile is a leucine amide. 

12. A customized carboxypeptidase derived from a known 
carboxypeptidase, wherein the customized 
carboxypeptidase has a modified active site and is 
capable of catalyzing a transacylation reaction in 
which a preselected substrate is modified by a 
preselected nucleophile, wherein the preselected 
substrate has an acidic or basic amino acid at the 
penultimate position and the preselected 
nucleophile is an amino acid amide. 

13. The customized carboxypeptidase of claim 12, 
wherein the preselected substrate has an arginine 
at the penultimate position. 

14. The customized carboxypeptidase of claim 13, 
wherein the preselected substrate is GRF (1-43) -Ala 
and the preselected nucleophile is leucine amide. 
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15. A customized protease of claim 12, wherein the 
modified active site has a different amino acid at 
the S 1 binding site than the known carboxypeptidase. 

16. A customized protease according to claim 12, 
wherein the modified active site has a different 
amino acid at the S 1 f site than the known 
carboxypeptidase . 

17. A customized carboxypeptidase derived from a known 
carboxypeptidase, wherein the customized 
carboxypeptidase has a different amino acid residue 
at a position equivalent in the known 
carboxypeptidase to position 178 or 398 of 
carboxypeptidase Y. 

18. A customized protease exhibiting enhanced 
transacylation capability which is derived from a 
known protease, wherein the active site of the 
known protease has been modified so that the 
transacylating activity of the known protease has 
been enhanced. 

19. A customized protease according to claim 18, 
wherein the known protease is a carboxypeptidase 
and the active site has been modified by 
substituting the amino acid residue at a position 
equivalent to 178 in carboxypeptidase Y with 
serine . 

20. A method for preparing a customized protease 
derived from a known protease and that has a 
modified active site and that functions to alter 
the transacylating capability of the known protease 
comprising the steps of: 

(a) providing a DNA sequence that encodes the 
known protease; 
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(b) modifying at least one codon for at least 
one amino acid residue in the active site to form a 
mutant DNA sequence, wherein the modified codon 
encodes a different amino acid residue and wherein 
the different amino acid residue provides a 
customized protease with an altered transacylation 
capability; and 

(c) transforming a suitable host cell with 
the mutant DNA sequence to provide for expression 
of the mutant DNA sequence to yield the customized 
protease. 

21. A method according to claim 20, wherein the DNA 
sequence corresponds to a DNA sequence for 
carboxypeptidase Y. 

22. A method according to claim 20, wherein the DNA 
sequence encodes an endoprotease . 

23. A method according to claim 20, wherein the. amino 
acid is an amino acid at the S x binding site of the 
active site of carboxypeptidase Y. 

24. A method according to claim 23, wherein the amino 
acid is selected from the group consisting of 
Tyrl47, Leul78, Tyrl85, Tyrl88, Asn241, Leu245, 
Trp312, Ile340, Cys341 and mixtures thereof. 

25\ A method according to claim 20, wherein the amino 

acid is an amino acid at the S x ' binding site of the 
active site of carboxypeptidase Y. 

26. A method according to claim 25, wherein the amino 
acid is an amino acid selected from the group 
consisting of Trp49, Asn51, Gly52, Cys56, ThrGO, 
Phe64, Glu65, Glul45, Tyr256, Tyr269, Leu272, 
Ser297, Cys298, Met398 and mixtures thereof. 
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27. A method according to claim 26, wherein the amino 
acid is methionine 398. 

28. The method according to claim 20, wherein the step 
of modifying further comprises: 

(a) deleting the codon for the at least one 
amino acid residue in the active site; and 

(b) replacing the deleted codon with an 
oligonucleotide, wherein the oligonucleotide 
encodes the different amino acid residue. 

29. A method according to claim 20, wherein the step of 
transforming a suitable host cell further 
comprises : 

(a) transforming a strain of S . cerevisiae 
that is capable of secreting proteases; 

(b) selecting the transformed cells for 
expression of the mutant protease; and 

(c) screening the expressed mutant protease 
for the ability to catalyze transacylation 
reactions with a preselected substrate and/or 
preselected nucleophile . 

A method for mutagenizing a known protease to form 
a customized protease having an altered 
transacylation capability comprising the steps: 

(a) providing a DNA sequence for a known 
protease ; 

(b) modifying the DNA sequence of at least 
one targeted position by inserting at the targeted 
position a DNA sequence having a stop codon and/or 
a DNA sequence recognized by a restriction enzyme 
to form a modified DNA sequence encoding an 
inactive form of the known protease, wherein the 
targeted position includes a codon for an amino 
acid in the active site of. the known protease to be 



30. 
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mutated and the stop codon is inserted in place by 
the codon for the amino acid in the active site; 

(c) generating single stranded modified DNA; 

(d) synthesizing and amplifying a mutant DNA 
sequence encoding the customized protease by 
incubating the single stranded modified DNA with 
one or more first degenerate oligonucleotides that 
contain a codon for a different amino acid than the 
amino acid present in the active site and in at the 
targeted site of the known protease; and 

(e) selecting and screening for the mutant DNA 
sequence encoding the customized protease. 

31. A method according to claim 30, wherein the step of 
synthesizing and amplifying includes one or more 
second oligonucleotides, wherein the second 
oligonucleotide includes the codons of the known 
protease at a targeted site and replaces one or 
more of the first degenerate oligonucleotides . 

32. A method according to claim 30, wherein the DNA 
sequence for the known protease substantially 
corresponds to the DNA sequence of carboxypeptidase 
Y. 

33. A method according to claim 32, wherein the 
targeted site encodes an amino acid selected from 
the group consisting of Tyrl47, Leul78, Tyrl85, 
Tyrl88, Glu215, Arg216, Asn241, Leu245, Trp312, 
Ile340, Cys341 and mixtures thereof. 

34. A method according to claim 33, wherein the 
targeted site encodes amino acid 178. 

35. A method according to claim 30, wherein the 
modified DNA sequence has a stop codon selected 
from the group consisting of amber, ochre and opal. 
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36. A method according to claim 35, wherein the 

modified DNA sequence has an opal stop codon and 
restriction endonuclease recognition site for EcoRI 
at the targeted site. 



37. A method according to claim 30, wherein the step of 
generating an effective amount of single stranded 
modified DNA comprises the steps: 

(a) inserting the modified DNA sequence in a 
phagemid shuttle vector, wherein the phagemid 
-shuttle vector has a mutated ampicillin resistance 
gene ; 

(b) amplifying the shuttle vector in a 
suitable host cell; and 

(c) generating single stranded modified DNA. 

38. A method according to claim 37, wherein the step of 
synthesizing and amplifying further comprises one 
or more oligonucleotides that repair the ampicillin 
resistance gene. 

39. A method according to claim 38, wherein the step of 
selecting comprises: 

(a) transforming a suitable host cell, 
wherein the host cell requires at least one amino 
acid to grow; 

(b) selecting the transformed cells that can 
grow on medium deficient in the amino acid required 
for growth supplemented with a preselected peptide 
containing the required amino acid at the 
C-terminus; and 

(c) identifying the cells with the DNA 
sequence encoding the customized protease by 
screening for mutant proteases that are capable of 
transacylating a preselected substrate that is not 
substantially acted upon by the known protease. 
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40. A process for modifying a preselected substrate by 
transacylation using a customized protease derived 
from a known protease comprising: 

(a) incubating a customized protease with a 
preselected substrate and a preselected nucleophile 
to form a mixture, wherein the customized protease 
has a modified active site and wherein the 
preselected substrate and the preselected 
nucleophile are not substantially transacylated by 
the known protease; and 

(b) incubating the mixture sufficiently to 
form a preselected transacylation product. 

41. A method according to claim 40, wherein the 
preselected substrate is growth hormone releasing 
factor with a C- terminal alanine. 

42. A method according to claim 40, wherein the 
preselected nucleophile is a leucine amide. 

43. A method according to claim 40, wherein the 
customized protease is a carboxypeptidase Y having 
a modified active site, wherein the active site has 
a serine at amino acid 178 . 

44. A method according to claim 40, wherein the 
preselected transacylation product is growth 
hormone releasing factor having a leucine amide at 
the C- terminal. 



